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HUMAN NUCLEOTIDE PYROPHOSPHOHYDROLASE-2 

A portion of the disclosure of this patent document contains material which is 
5 subject to copyright protection. The copyright owner has no objection to the facsimile 
reproduction by any one of the patent document or the patent disclosure, as it appears in the 
Patent and Trademark Office patent file or records, but otherwise reserves all copyright 
rights whatsoever. 

10 TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of a human 
nucleotide pyrophosphohydrolase-2 and to the use of these sequences in the diagnosis, 
prevention, and treatment of arthropathies, immunological disorders, and cancers. 

BACKGROUND 

15 Calcium pyrophosphate dihydrate (CPPD) deposition disease is an arthropathy 

characterized by the accumulation of CPPD crystals in articular tissues including synovial 
fluid. CPPD crystals contribute significantly to the chronic pain and tissue damage of 
joint degeneration, and in vitro they induce neutrophil activation and fibroblast and 
chondrocyte mitogenesis as well as the production of matrix metalloproteinases (MMP) 

20 and prostaglandins. CPPD deposition is associated with acute inflammatory episodes 
(pseudogout); chronic arthritis, and degenerative joint disease. Although only about 10% 
of the CPPD patient population ever experience acute inflammatory attacks, the majority 
of patients with chronic arthritis of the large joints have CPPD deposition. CPPD crystals 
play a significant role in arthritic disease progression. Synovial fluids containing CPPD 

25 crystals sampled from patients with degenerative joint disease have high concentrations of 
cartilage fragments and MMPs, e.g., collagenase and stromelysin. (Swan, A.B. et al. 
(1994) Ann. Rheum, bis: 53:467-470; Lohmander, L. S. et al. (1993) Arthritis. Rheum. 
36:181-189.) 

4 

Deposition of CPPD crystals appears to be related to excess levels of extracellular 
30 calcium, pyrophosphate (PPi), or both. Whereas elevated calcium levels do not appear to 
be a major contributing factor to CPPD deposition in joints, elevated PPi levels have been 
observed in the synovial fluids from patients with CPPD deposition. Synovial fluid PPi 



«n9fli(wi I * 



J 



WO 99732610 PCT/US98/25558 

may be produced by joint tissues because PPi levels are higher in synovial fluid than in 
plasma and in vitro cartilage explants release PPi into the extracellular medium, (kyan, L. 
M et aL (1996) J. Rheumatol. 23:214-219.) 

Enzymes that hydrolyze nucleotide triphosphates and release PPi are called 
5 nucleotide pyrophosphohydrolases (NTPPH). NTPPH activity is found in synovial fluid 
and correlates with the production of PPi. Elevated ATP levels have been found in joint 
fluids of patients with CPPD deposition, and addition of extracellular ATP to joint tissues 
and fluids results in the production of PPi. (Park, W.l. et al. (1996) J. Rheumatol. 23:665- 
671 .) The levels of molecules with NTPPH activity are higher in extracts from cartilage 

1 0 containing CPPD crystals than from cartilage lacking crystals. Matrix vesicles released 
from articular cartilage in vitro show high NTPPH activity and produce CPPD in the 
presence of calcium and ATP. (Derfus, BA. et al. (1992) Arthritis. Rheum. 35:231-240.) 

A protein demonstrating NTPPH activity and having a molecular weight of 61 kD 
was recently purified from porcine articular cartilage explant conditioned medium. 

15 (Masuda, I. et al. (1995) J. Clin. Invest. 95:699-704.) The first 26 residues of the amino- 
terminus were sequenced and showed no homology to any protein in public databases. 
Antipeptide antibodies were generated against the 61 kD porcine protein, and the antisera 
identified the original 61 kD protein and an additional 127 kD vesicle-associated protein in 
conditioned medium from cultures of both chondrocytes and cartilage explants. The 61 

20 kD ispform is believed to be a catalytically active proteolytic fragment of the 127 kD 
protein. Both the 61 kD and the 127 kD isoforms were identified in human synovial 
fluids, and a 1 00 kD protein was identified in human serum. Using the antipeptide 
antibody on immunoblots of tissue extracts, NTPPH expression was found only in 
articular tissues, e.g., hyaline cartilage, fibrocartilage, tendon, and ligament, in which 

25 CPPD deposition occurs. (Cardenal, A. et al. (1 996) Arthritis Rheum. 39:252-256; 
Cardenal, A. et al. (1996) Arthritis Rheum. 39:245-251.) Recently, a partial porcine 
NTPPH cDNA was isolated. (Masuda, I. et al. (1997) Gene 197:227-282.) 

A full length human nucleotide pyrophosphohydrolase (NTPPH-1 ) has been cloned 
using a cDNA clone isolated from a cartilagie cDNA library. Northern analysis of human, 

30 

dog, and rabbit joint tissue RNA samples indicated elevated levels of NTPPH-1 expression 
in articular cartilage, and lower, but significant, levels of expression in synovium, 
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meniscus, tendon and ligament. Expression studies on additional human tissues 
demonstrated significant mRNA levels in skeletal muscle, heart muscle, and bone marrow; 
and lower, but detectable, levels in trachea, spinal cord, thyroid, stomach, testis, uterus, 
small intestine, colon, thymus, placenta, lymph, and adrenal tissue. 
5 The discovery of a new human nucleotide pyrophosphohydrolase, NTPPH-2, and 

the polynucleotides encoding it satisfies a need in the art by providing new compositions 
which are useful in the diagnosis, prevention and treatment of arthropathies, 
immunological disorders, and cancers. 

10 SUMMARY OF THE INVENTION 

The invention features a substantially purified polypeptide, human nucleotide 
pyrophosphohydrolase-2 (NTPPH-2), comprising the amino acid sequence of SEQ ID 
NO: 1 or a fragment of SEQ ID NO: 1 . 

15 The invention further provides a substantially purified variant of NTPPH-2 having 

at least 90% amino acid identity to the amino acid sequence of SEQ ID NO:l or a 
fragment of SEQ ID NO: 1 . The invention also provides an isolated and purified 
polynucleotide encoding the polypeptide comprising the amino acid of SEQ ID NO: l or a 
fragment of SEQ ID NO: 1 . The invention also includes an isolated and purified 

20 polynucleotide variant having at least 90% polynucleotide identity to the polynucleotide 
encoding the polypeptide comprising the amino acid sequence of SEQ ID NO:l or a 
fragment of SEQ ID NO : 1 . 

Additionally, the invention provides a composition comprising a polynucleotide 
encoding the polypeptide comprising the amino acid sequence of SEQ IDNO:l or a 

25 fragment of SEQ ID NO: 1 . The invention further provides an isolated and purified 
polynucleotide which hybridizes under stringent conditions to the polynucleotide . 
encoding the polypeptide comprising the amino acid sequence of SEQ ID NO: l or a 
fragment of SEQ ID NO:l, as Well as an isolated and purified polynucleotide which is 
complementary to the polynucleotide encoding the polypeptide comprising the amino acid 

30; sequence of SEQ ID NO: 1 or a fragment of SEQ m NO: 1. 

The invention also provides an isolated and purified polynucleotide comprising 
SEQ ID NO:2 or a fragment of SEQ ID NO:2, and an isolated and purified polynucleotide 
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variant having at least 90% polynucleotide identity to the polynucleotide comprising SEQ 
ID NO:2 or a fragment of SEQ ID NO:2. Useful fragments of SEQ ID NO:2 may be 
selected from the group consisting of nucleotides 55 through 75, 481 through 507, 646 
through 669, 21 82 through 4149, 1726 through 4149, 757 through 4149, and 1 13 through 
5 4149. The invention additionally provides an isolated and purified polynucleotide which 

has been deposited as Accession No. in the American Type Culture Collection. The 

invention also provides an isolated and purified polynucleotide which is complementary to 
the polynucleotide comprising SEQ ID NO:2 or a fragment of SEQ ID NO:2. 

The invention further provides an expression vector containing at least a fragment 

10 of the polynucleotide encoding the polypeptide comprising the amino acid sequence of 
SEQ ID NO: 1 or a fragment of SEQ ID NO: 1 . In another aspect, the expression vector is 
contained within a host cell. 

The invention also provides a method for producing a polypeptide comprising the 
amino acid sequence of SEQ ID NO:l or a fragment of SEQ ID NO:l, the method 

15 comprising the steps of: (a) culturing the host cell containing an expression vector 
containing at least a fragment of a polynucleotide encoding NTPPH-2 under conditions 
suitable for the expression of the polypeptide; and (b) recovering the polypeptide from the 
host cell culture. 

The invention also provides a pharmaceutical composition comprising a 
20 substantially purified NTPPH-2 having the amino acid sequence of SEQ ID NO: 1 or a. 
fragment of SEQ ID NO: 1 and a suitable pharmaceutical carrier. 

The invention further includes a purified antibody which binds to a polypeptide 
comprising the amino acid sequence of SEQ ID NO: 1 or a fragment of SEQ ID NO: 1 , as 
well as a purified agonist and a purified antagonist of the polypeptide. 
25 The invention also provides a method for detecting a polynucleotide encoding 

NTPPH-2 in a biological sample containing nucleic acids, the method comprising the steps 
of: (a) hybridizing the complement of the polynucleotide encoding the polypeptide 

comprising SEQ ID NO:l or a fragment of SEQ IDNO:l to at least one of the nucleic 
30 acids of the biological sample, thereby forming a hybridization complex; and (b) detecting 
the hybridization complex, wherein the presence of the hybridization complex correlates 
with the presence of a polynucleotide encoding NTPPH-2 in the biological sample. In one 
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aspect, the nucleic acids of the biologicai sample ate amplified by the polymerase chain 
reaction prior to the hybridizing step. 

The invention also provides a method for treating or preventing an arthropathy, 
comprising administering to a subject in need of such treatment an effective amount of an 
5 antagonist of NTPPH-2. 

The invention also provides a method for treating or preventing an immunological 
disorder comprising adniinistering to a subject in need of such treatment an effective 
.amount of an antagonist of NTPPH-2. 

The invention also provides a method for treating or preventing a cancer 
10 compri sing administering to a subject in need of such treatment an efifective amount of an 
antagonist of NTPPH-2. 

The invention also provides a method for detecting a polynucleotide encoding 
NTPPH-2 in a biological sample containing nucleic acids, the method comprising the steps 
of: (a) hybridizing the complement of the polynucleotide encoding the polypeptide 
comprising SEQ ID NO:l orafragment of SEQ IDNO:l to at least one of the nucleic 
acids of the biological sample, thereby forming a hybridization complex; and (b) detecting 
the hybridization complex, wherein the presence of the hybridization complex correlates 
with the presence of a polynucleotide encoding NTPPH-2 in the biologicai sample. In one 
aspect, the nucleic acids of the biological sample are amplified by the polymerase chain . 
20 reaction prior to the hybridizing step. 

The invention also provides a method for detecting NTPPH-2 in a biological 
sample comprising the steps of: a) providing a biological sample; b) combining the 
biological sample and. an antibody which binds to a polypeptide comprising the amino acid 
sequence of SEQ ID NO:l or a fragment of SEQ ID NO:l under suitable conditions for 
25 complex formation to occur between NTPPH-2 and the antibody; and c) detecting 

complex formation between NTPPH-2 and the antibody, thereby establishing the presence 
of NTPPH-2 in the biological sample. 

The invention provides a method for screening a library of small molecules to 
identify a molecule, which binds NTPPH-2, the method comprising the steps of: a) 
providing a library of small molecules; b) combining the library of small molecules with . 
the polypeptide of SEQ ID NO:2 or a fragment of SEQ ID NO:2 under suitable conditions 
for complex formation; and c) detecting complex formation wherein the presence of the. 
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complex identifies a small molecule which binds NTPPH-2. 

The invention also provides a method for identifying an agonist, the methbd 
comprising the steps of: a) delivering one of the small molecules identified by screening 
a library of small molecules and gamma labeled ATP into cells transformed with a vector 
5 expressing NTPPH-2; b) growing the cells under suitable conditions; and c) assaying for 
an increased amount of PPi thereby establishing that the small molecule is an agonist 

The invention further provides a method for identifying an antagonist, the method 
comprising the steps of: a) delivering one of the small molecules identified by screening 
a library of small molecules and gamma labeled ATP into cells transformed with a vector 
10 expressing NTPPH-2; b) growing the cells under suitable conditions; and c) assaying for 
a decreased amount of PPi thereby establishing that the small moiecule is an antagonist. 

BRIEF DESCRIPTION OF THE FIGURES 
Figures 1A, IB, 1C, iD, IE, IF, 1G, 1H , 11, 1J, and IK show the amino acid 
15 sequence (SEQ ID NO:l) and nucleic acid sequence (SEQ ID NO:2) of human nucleotide 
pyrophosphohydrolase, NTPPH-2. The alignment was produced using MacDNAsis 
PRO™ software (Hitachi Software Engineering Co. Ltd. San Bruno, CA). 

Figures 2A, 2B, and 2C show sequence alignments between NTPPH-2 (SEQ ID 
NO:l) andNTPPH-1 (SEQIDNO:3). 
20 Figures 3 A and 3B show hydrophobicity plots for NTPPH-2 (SEQ ID NO: 1) and 

NTPPH-1 (SEQ ID NO:3); the positive X axis reflects amino acid position, and the 
negative Y axis, hydrophobicity (MacDNAsis PRO software). 
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DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is 
understood that this invention is not limited to the particular methodology, protocols, cell 
lines, vectors, and reagents described, as these may vary. It is also to be understood that 
5 the terminology used herein is for the purpose of describing particular embodiments only, 
and is not intended to limit the scope of the present invention which will be limited only 
by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms 
"a," ' 'an," and "the" include plural reference unless the context clearly dictates otherwise. 
10 Thus, e.g., a reference to "a host cell" includes a plurality of such host cells, and a 

reference to "an antibody" is a reference to one or more antibodies and equivalents thereof 
known to those skilled in the art, and so forth. 

Unless defined otherwise, all technical and scientific terms used herein have the 
same meanings as commonly understood by one of ordinary skill in the art to which this 
15 invention belongs. Although any methods and materials similar or equivalent to those 
described herein can be used in the practice or testing of the present invention, the 
preferred methods, devices, and materials are now described. All publications mentioned 
herein are cited for the purpose of describing and disclosing the cell lines, vectors, and 
methodologies which are reported in the publications and which might be used in 
20 connection with the invention. Nothing herein is to be construed as an admission that the 
invention is not entitled to antedate such disclosure by virtue of prior invention. 

DEFINITIONS 

"NTPPH-2," as used herein, refers to the amino acid sequences of substantially 
25 purified NTPPH-2 obtained from any species, particularly a mammalian species, including 
bovine, ovine, porcine, murine, equine, and preferably the human species, from any : 
source, whether natural, synthetic, semi-synthetic, or recombinant. 

The term "agonist," as used herein, refers to a molecule which, when bound to 
NTPPH-2, increases or prolongs the duration of the effect of NTPPH-2. Agonists may 
30 include proteins, nucleic acids, carbohydrates, or any other molecules which bind to aiid 
modulate the effect of NTPPH-2. 

An "allele" or an "allelic sequence," as these terms are used herein, is an 
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alternative form of the gene encoding NTPPH-2. Alleles may result from at least one 
mutation in the nucleic acid sequence and may result in altered mRNAs or in polypeptides 
whose structure or function may or may not be altered. Any given natural or recombinant 
gene may have none, one, or many allelic forms. Common mutational changes which give 
5 rise to alleles are generally ascribed to natural deletions* additions, or substitutions of 
nucleotides. Each of these types of changes may occur alone, or in combination with the 
Others, one or more times in a given sequence. 

"Altered" nucleic acid sequences encoding NTPPH-2, as described herein, include 
those sequences with deletions, insertions, or substitutions of different nucleotides, 

10 resulting in a polynucleotide the same as NTPPH-2 or a polypeptide with at least one 
functional characteristic of NTPPH-2. Included within this definition are polymoiphisms 
which may or may not be readily detectable using a particular oligonucleotide probe of the 
polynucleotide encoding NTPPH-2, and improper or unexpected hybridization to alleles, 
with a locus other than the normal chromosomal locus for the polynucleotide sequence 

15 encoding NTPPH-2. The encoded protein may also be "altered,- 1 arid may contain 

deletions, insertions, or substitutions of amino acid residues which produce a silent change 
and result in a functionally equivalent NTPPH-2. Deliberate amino acid substitutions may 
be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, 
hydrophilicity, and/or the amphipathic nature of the residues, as long as the biological or 

20 immunological activity of NTPPH-2 is retained. For example, negatively charged amino 
acids may include aspartic acid and glutamic acid, positively charged amino acids may 
include lysine and arginine, and amino acids with uncharged polar head groups having 
similar hydrophilicity values may include leucine, isoleucine, and valine; glycine and 
alanine; asparagine and glutamine; serine and threonine; and phenylalanine and tyrosine. 

25 The terms "amino acid" or "amino acid sequence," as used herein, refer to an 

oligopeptide, peptide, polypeptide, or protein sequence, or a fragment of any of these, and 
to naturally occurring or synthetic molecules. In this context, "fragments" refers to 
fragments of NTPPH-2 which are preferably about 5 to about 1 5 amino acids in length and 
which retain some biological activity or immunological activity of NTPPH-2. Where 

30 "amino acid sequence" is recited herein to refer to an amino acid sequence of a naturally 
occurring protein molecule, "amino acid sequence" and like terms are not meant to limit 
the amino acid sequence to the complete native amino acid sequence associated with the 

-8- 
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recited protein molecule. 

"Amplification," as used herein, relates to the production of additional copies of a 
nucleic acid sequence. Amplification is generally carried out using polymerase chain 
reaction (PCR) technologies well known in the art (Dieffenbach, C.W. and G.S: Dveksler 
5 0 995) PCR Primer, a laboratory Manual, Cold Spring Harbor Press, Plainview, NY.) 

The term "antagonist," as it is used herein, refers to a molecule which, when bound 
to NTPPH-2, decreases the amount or the duration of the effect of the biological or 
immunological activity of NTPPH-2. Antagonists may include proteins, nucleic acids, 
carbohydrates, antibodies, or any other molecules which decrease the effect of NTPPH-2. 
10 As used herein, the term "antibody" refers to intact molecules as well as to 

fragments thereof, such as Fa, F^b'^, and Fv fragments, which are capable of binding the 
epitopic detenninant Antibodies that bind NTPPH-2 polypeptides can be prepared using 
intact polypeptides or using fragments containing small peptides pf interest as the 
immunizing antigen. The polypeptide or oligopeptide used to immunize an animal (eig., a 
15 mouse, a rat, or a rabbit) can be derived from the translation of RNA, of synthesized 

chemically, and can be conjugated to a carrier protein if desired. Commonly used carriers 
that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, and 
keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize the 
animal. 

20 The term "antigenic determinant," as used herein, refers to that fragment of a 

molecule (i.e., an epitope) that makes contact with a particular antibody. When a protein 
or a fragment of a protein is used to immunize a host animal, numerous regions of the 
protein may induce the production of antibodies which bind specifically to antigenic 
determinants (given regions or three-dimensional structures on the protein). An antigenic 
25 determinant may compete with the intact antigen (i.e., the immunogen used to elicit the 
immune response) for binding to an antibody. 

The term "antisense," as used herein, refers to any composition containing a 
nucleic acid sequence which is complementary to a specific DNA or RNA sequence. The 
term "antisense strand" is used in reference to a nucleic acid strand that is complementary 
30 to the "sense" strand. Antisense molecules may be produced by any method including 
synthesis or transcription. Once introduced into a cell, the complementary nucleotides 
combine with natural sequences produced by the cell to form duplexes and to block either 
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transcription or translation. The designation "negative" can refer to the antisense strand, 
and the designation "positive" can refer to the sense strand. 1 

As used herein, the term "biologically active," refers to a protein having structural, 
regulatory, or biochemical functions of a naturally occurring molecule. Likewise, 
5 "immunologically active" refers to the capability of the natural, recombinant, or synthetic 
NTPPH-2, or of any oligopeptide thereof, to induce a specific immune response in 
appropriate animals or cells and to bind with specific antibodies. 

The terms "complementary" or "complementarity," as used herein, refer to the 
natural binding of polynucleotides under permissive salt and temperature conditions by 

10 base pairing. For example, the sequence "A-G-T" binds to the complementary sequence 
"T-C-A." Complementarity between two single-stranded molecules may be "partial," 
such that only some of the nucleic acids bind, or it may be "complete," such that total 
complementarity exists between the single stranded molecules! The degree of 
complementarity between nucleic acid strands has significant effects on the efficiency and 

15 strength of the hybridization between the nucleic acid strands. This is of particular 

importance in amplification reactions, which depend upon binding between nucleic acids 
strands, and in the design and use of peptide nucleic acid (PNA) molecules. 

A "composition comprising a given polynucleotide sequence" or a "composition 
comprising a given amino acid sequence," as these terms are used herein, refer broadly to 

20 any composition containing the given polynucleotide or amino acid sequence. The 
composition may comprise a dry formulation or an aqueous solution. Compositions 
comprising polynucleotide sequences encoding NTPPH-2 or fragments of NTPPH-2 may 
be employed as hybridization probes. The probes may be stored in freeze-dried form and 
may be associated with a stabilizing agent such as a carbohydrate. In hybridizations, the 

25 probe may be deployed in an aqueous solution containing salts (e.g., NaCl), detergents 
(e.g., SDS), and other components (e.g., Denhardt's solution, dry milk, salmon sperm 
DNA, etc.). 

The phrase "consensus sequence," as used herein, refers to a nucleic acid sequence 
which has been resequenced to resolve uncalled bases, extended using XL-PCR™ (Perkin 
30 Elmer, Norwalk, CT) in the 5' and/or the 3' direction, and resequenced, or which has been 
assembled from the overlapping sequences of more than one Incyte Clone using a 
computer program for fragment assembly, such as the GELVIEW™ Fragriient Assembly 
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? 

system (GCG, Madison, WI). Some sequences have been both extended and assembled to 
produce the consensus sequence . 

As used herein, the term "correlates with expression of a polynucleotide" indicates 
that the detection of the presence of nucleic acids, the same or related to a nucleic acid 
5 sequence encoding NTPPH-2, by northern analysis is indicative of the presence of nucleic 
acids encoding NTPPH-2 in a sample, and thereby correlates with expression of the 
transcript from the polynucleotide encoding NTPPH-2. 

A "deletion," as the term is used herein, refers to a change in the amino acid or 
nucleotide sequence that results in the absence of one or more amino acid residues or 
10 nucleotides. 

The term "derivative," as used herein, refers to the chemical modification of 
NTPPH-2, of a polynucleotide sequence encoding NTPPH-2, or of a polynucleotide 
sequence complementary to a polynucleotide sequence encoding NTPPH-2. Chemical 
modifications of a polynucleotide sequence can include, e.g., replacement of hydrogen by 
15 an alkyl, acyl, or amino group. A derivative polynucleotide encodes a polypeptide which 
retains at least one biological or immunological function of the natural molecule. A 
derivative polypeptide is one modified by glycosylation, pegylation, or any similar process 
that retains at least one biological or immunological function of the polypeptide from 
which it was derived. 

20 The term "homology," as used herein, refers to a degree of complementarity. 

There may be partial homology or complete homology. The word "identity" may 
substitute for the word "homology." A partially complementary sequence that at least 
partially inhibits an identical sequence from hybridizing to a target nucleic acid is referred 
to as "substantially homologous." The inhibition of hybridization of the completely 

25 complementary sequence to the target sequence may be examined using a hybridization 
assay (Southern or northern blot, solution hybridization, and the like) under conditions of 
reduced stringency. A substantially homologous sequence or hybridization probe will 
compete for and inhibit the binding of a completely homologous sequence to the target 
sequence under conditions of reduced stringency. This is not to say that conditions of 

30 reduced stringency are such that non-specific binding is permitted, as reduced stringency 
conditions require that the binding of two sequences to one another be a specific (i.e., a 
selective) interaction. The absence of non-specific binding may be tested by the use of a 
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second target sequence which lacks even a partial degree of complementarity (e.g., less 
than about 30% homology or identity). In the absence of non-specific binding, thi 
substantially homologous sequence or probe will not hybridize to the second non- 
complementary target sequence. 
5 "Human artificial chromosomes" (HACs), as described herein, are linear 

microchromosomes which may contain DNA sequences of about 10 kb to 10 mb in size, 
and which contain all of the elements required for stable mitotic chromosome segregation 
and maintenance. (Harrington, J.J. et al. (1 997) Nat Genet. 1 5:345-355.) 

The term "humanized antibody," as used herein, refers to antibody molecules in 
10 which the amino acid sequence in the non-antigen binding regions has been altered so that 
the antibody more closely resembles a human antibody, and still retains its original 
binding ability. 

"Hybridization," as the term is used herein, refers to any process by which a strand 
of nucleic acid binds with a complementary strand through base pairing. 

15 As used herein, the term "hybridization complex" as used herein, refers to a 

complex formed between two nucleic acid sequences by virtue of the formation of 
hydrogen bonds between complementary bases. A hybridization complex may be formed 
in solution (e.g., C 0 t or I^t analysis) or formed between one nucleic acid sequence present 
in solution and another nucleic acid sequence immobilized on a solid support (e.g., paper, 

20 membranes, filters, chips, pins or glass slides, or any other appropriate substrate to which 
cells or their nucleic acids have been fixed). 

"Inflammation" as used herein is interchangeable with "immune response", with 
both terms referring to a condition associated with trauma, immune disorders, and 
infectious or genetic diseases and are characterized by production of cytokines, 

25 chemokines, and other signaling molecules which activate cellular and systemic defense 
systems. 

The words "insertion" or "addition," as used herein, refer to changes in an amino 
acid or nucleotide sequence resulting in the addition of one or more amino acid residues or 
nucleotides, respectively, to the sequence found in the naturally occurring molecule. 
30 The term "microarray," as used herein, refers to an array of distinct polynucleotides 

or oligonucleotides arrayed on a substrate, such as paper, nylon or any other type of 
membrane, filter, chip, glass slide, or any other suitable solid support. 
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The term "modulate," as it appears herein, refers to a change in the activity of 
NTPPH-2. For example, modulation may cause an increase or a decrease in protein 
activity, binding characteristics, or any other biological, functional, or immunological 
properties of NTPPH-2. 
5 The phrases "nucleic acid" or "nucleic acid sequence," as used herein, refer to an 

oligonucleotide, nucleotide, polynucleotide, or any fragment thereof, to DNA or RNA of 
genomic or synthetic origin which may be single-stranded of double-stranded and may 
represent the sense or the antisense strand, to peptide nucleic acid (PNA), or to any DNA- 
like or RNA-like material. In thiscontext, "fragments" refers to those nucleic acid 
10 sequences which are greater than about 60 nucleotides in length, and most preferably are at 
least about 1 00 nucleotides, at least about 1 000 nucleotides, or at least about 1 0,000 
nucleotides in length. 

The term "oligonucleotide," as used herein, refers to a nucleic acid sequence of at 
least about 6 nucleotides to 60 nucleotides, preferably about 15 to 30 nucleotides, and 
1 5 most preferably about 20 to 25 nucleotides, which can be used in PCR amplification or in 
a hybridization assay or microarray. As used herein, the term "oligonucleotide" is 
substantially equivalent to the terms "amplimers,""primers," "oligomers," and "probes," as 
these terms are commonly defined in the art. 

"Peptide nucleic acid" (PNA), as used herein, refers to an antisense molecule or 
20 anti-gene agent which comprises an oligonucleotide of at least about 5 nucleotides in 
length linked to a peptide backbone of amino acid residues ending in lysine. The terminal 
lysine confers solubility to the composition. PNAs preferentially bind complementary 
single stranded DNA and RNA and stop transcript elongation, and may be pegylated to 
extend their lifespan in the cell. (Nielsen, P.E. et al. (1993) Anticancer Drug Des. 8:53-63. 
25 The term "sample," as used herein, is used in its broadest sense. A biological 

sample suspected of containing nucleic acids encoding NTPPH-2, or fragments thereof, or 

NTPPH-2 itself may comprise a bodily fluid; an extract from a cell, chromosome, 
organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or cDNA (in 
solution or bound to a solid support); a tissue; a tissue print; and the like. 
30 As used herein, the terms "specific binding" or "specifically binding" refer to that 

interaction between a protein or peptide and an agonist, an antibody, or an antagonist: The 
interaction is dependent upon the presence of a particular structure of the protein 
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recognized by the binding molecule (i.e., the antigenic determinant or epitope). For 
example, if an antibody is specific for epitope M A," the presence of a polypeptide ' 
containing the epitope A, or the presence of free unlabeled A, in a reaction containing free 
labeled A and the antibody will reduce the amount of labeled A that binds to the antibody. 
5 As used herein, the term "stringent conditions" refers to conditions which permit 

hybridization between polynucleotide sequences and the claimed polynucleotide 
sequences. Suitably stringent conditions can be defined by, e!g., the concentrations of salt 
or formamide in the prehybridization and hybridization solutions, or by the hybridization 
temperature, and are well known in the art. In particular, stringency can be increased by 

10 reducing the concentration of salt, increasing the concentration of formamide, or raising 
the hybridization temperature. 

For example, hybridization under high stringency conditions could occur in about 
50% formamide at about 37°C to 42°C. Hybridization could occur under reduced 
stringency conditions in about 35% to 25% formamide at about 30°C to 35°C. In 

15 particular, hybridization could occur under high stringency conditions at 42°C in 50% 
formamide, 5X SSPE, 0.3% SDS, and 200 /ig/ml sheared and denatured salmon sperm 
DNA. Hybridization could occur under reduced stringency conditions as described above, 
but in 35% formamide at a reduced temperature of 35°C. The temperature range 
corresponding to a particular level of stringency can be further narrowed by calculating the 

20 purine to pyrimidine ratio of the nucleic acid of interest and adjusting the temperature 
accordingly. Variations on the above ranges and conditions are well known in the art. 

The term "substantially purified," as used herein, refers to nucleic acid or amino 
acid sequences that are removed from their natural environment and are isolated or 
separated, and are at least about 60% free, preferably about 75% free, and most preferably 

25 about 90% free from other components with which they are naturally associated. 

A "substitution," as used herein, refers to the replacement of one or more amino 
acids or nucleotides by different amino acids or nucleotides, respectively 

"Transformation," as defined herein, describes a process by which exogenous DNA 
enters and changes a recipient cell. Transformation may occur under natural or artificial 

30 conditions according to various methods well known in the art, and may rely on any 
known method for the insertion of foreign nucleic acid sequences into a prokaryotic or 
eukaryotic host cell. The method for transformation is selected based on the type of host 
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cell being transformed and may include, but is not limited to, viral infection, 
electroporation, heat shock, lipofection, and particle bombardment. The term I 
"transformed" cells includes stably transformed cells in which the inserted DNA is capable , 
of replication either as aii autonomously replicating plasmid or as part of the host 
5 chromosome, and refers to cells which transiently express the inserted DNA or RNA for 
limited periods of time. 

A "variant" of NTPPH^2, as used herein, refers to an amino acid sequence that is 
altered by one or more amino acids. The variant may have "conservative" changes, 
wherein a substituted amino acid has similar structural or chemical properties (e.g., 

1 0 replacement of leucine with is oleucirie). More rarely, a variant may have 

"nonconservative" changes (e.g., replacement of glycine with tryptophan). Analogous 
minor variations may also include amino acid deletions or insertions, or both. Guidance in 
determining which amino acid residues may be substituted, inserted, or deleted without 
abolishing biological or immunological activity may be found using computer programs 

15 well known in the art, e.g., DNASTAR software. 

THE INVENTION 

The invention is based on the discovery of a new human nucleotide 
pyrophosphohydrolase (NTPPH-2), the polynucleotides encoding NTPPH-2, and the use 

20 of these compositions for the diagnosis, treatment, or prevention of arthropathies, 
immunological disorders, and cancers. 

Nucleic acids encoding the NTPPH-2 of the present invention were identified in 
Incyte Clones 1388013, 1423393, and 1423402 from the osteoarthritic chondrocyte cDNA 
library (SATPF 1008) using a computer search for nucleotide sequence homology arid the 

25 partial porcine cDNA sequence. (Masuda, supra .) A 4.1 kb sequence was identified in the 
chondrocyte library using the cDNA insert from Incyte Clone 1423393 as a hybridization 
probe. When a 700 bp restriction fragment from the 5' most coding region of 4.1 kb clone 
was used to rescreen the osteoarthritic chondrocyte library, the full length gene encoding 
NTPPH-2 with appropriate Kozak initiation and signal sequence was obtained. This 

30 sequence does not match any sequence in the public DNA sequence database. The cDNA 
for full length NTPPH-2 of this application has been deposited as Accession 
Number at the American Type Culture Collection, Bethesda, MD on fdateV 
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In one embodiment, the invention encompasses a polypeptide comprising the 
amino acid sequence of SEQ ID NO:l. NTPPH-2 is 1 156 amino acids in length (Figures 
1A-1K) and has three potential N-glycosylation sites at N„ 6 , N^g, N 329 , 25 potential 
phosphorylation sites at T 24 , S I35 , S^, T 245 , S^, S 325 , T 331 ,T 372 , S 427 , S 434 , S 439 , T 5l7 , T 523 , : 
5 Y 599 , T m , S^, T 75<h T M7 , S g83 , Y^, S 977 , S , 017> T l063 > S l06g> and Tj , 49 . As shown in Figures 
2A, 2B, and 2C, NTPPH-2 has chemical and structural homology with NTPPH-1 (SEQ ID 
NO:3). In particular, NTPPH-2 and NTPPH-1 share 50% sequence identity. Fragments of 
the nucleic acid sequence useful for designing oligonucleotides or to be used directly as 
hybridization probes to distinguish between these homologous molecules include the 

10 fragments from nucleotides 55 through 75, 481 through 507, 646 through 669, 21 82 
through 4149, 1726 through 4149, 757 through 4149, and 1 13 through 4149. As 
illustrated by Figures 3A and 3B, NTPPH-2 and NTPPH-1 have similar hydrophobicity 
plots and both show a hydrophobic signal sequence. The predicted isoelectric points for 
NTPPH-2 and NTPPH-1 are 8.07 and 8.21, respectively. Membrane-based northern 

15 analysis showed the highest level of NTPPH-2 mRNA expression in cartilage ahd lower, 
but significant, expression in testes, trachea, and bone marrow. Electronic northern 
analysis shows the expression of this sequence in various libraries at least 57% of which 
involve immunological response and many of which are cartilage or joint related and at 
least 26% of which involve immortalized or cancerous cells and tissues. Of particular note 

20 is the expression of NTPPH-2 in rheumatoid and osteoarthritic synovial, chondrocyte, and 
tibial libraries. 

The invention also encompasses NTPPH-2 variants. A preferred NTPPH-2 variant 
is one having at least about 80%, more preferably at least about 90%, and most preferably 
at least about 95% amino acid sequence identity to the NTPPH-2 amino acid sequence and 

25 which retains at least one biological, immunological or other functional characteristic or 
activity of NTPPH-2. 

The invention also encompasses polynucleotides which encode NTPPH-2. In a 
particular embodiment, the invention encompasses a polynucleotide sequence comprising 
the sequence of SEQ ID NO:2, which encodes an NTPPH-2. 

30 The invention also encompasses a variant of a polynucleotide sequence encoding 

NTPPH-2. In particular, such a variant polynucleotide sequence will have at least about 
80%, more preferably at least about 90%, and most preferably at least about 95% 
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polynucleotide sequence identity to the polynucleotide sequence encoding NTPPH-2. A 
particular aspect of the invention encompasses a variant of SEQ ID NO:2 which has at 
least about 80%, more preferably at least about 90%, and most preferably at least about 
95% polynucleotide sequence identity to SEQ ID NO:2. In addition, the amino acid 
sequences encoded by these variants may have at least one functional or structural 
characteristic of NTPPH-2. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of 
the genetic code, a multitude of polynucleotide sequences encoding NTPPH-2, some 
bearing minimal homology to the polynucleotide sequences of any known and naturally 
occurring gene, may be produced. Thus, the invention contemplates each and every 

possible variation of polynucleotide sequence that could be made by selecting 

combinations based on possible codon choices. These combinations are made in 
accordance with the standard triplet genetic code as applied to the polynucleotide sequence 
of naturally occurring NTPPH-2, and all such variations are to be considered as being 
15 specifically disclosed. 

Although nucleotide sequences which encode NTPPH-2 and its variants are 
preferably capable of hybridizing to the nucleotide sequence of the naturally occurring 
NTPPH-2 under appropriately selected conditions of stringency^ it may be advantageous to 
produce nucleotide sequences encoding NTPPH-2 or its derivatives possessing a 
20 substantially different codon usage. Codons may be selected to increase the rate at which 
expression of the peptide occurs in a particular prokaryotic or eukaryotic host in 
accordance with the frequency with which particular codons are utilized by the host. 
Other reasons for substantially altering the nucleotide sequence encoding NTPPH-2 and its 
derivatives without altering the encoded amino acid sequences include the production of 
25 RNA transcripts having more desirable properties, such as a greater half-life, than 
transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode 
NTPPH-2 and NTPPH-2 derivatives, or fragments thereof, entirely by synthetic chemistry. 
After production, the synthetic sequence may be inserted into any of the many available 
30 expression vectors and cell systems using reagents that are well known in the art. 
Moreover, synthetic chemistry may be used to introduce mutations into a sequence 
encoding NTPPH-2 or any fragment thereof. 
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Also encompassed by the invention are polynucleotide sequences that are capable 
of hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown 
in SEQ ID NO:2, or a fragment of SEQ ID NO:2, under various conditions of stringency 
as taught in Wahl, G.M. and S.L. Berger (1987; Methods En2ymol. 152:399-407) and • 
5 Kimmel, A.R. (1987; Methods Enzymol. 152:507-51 L) 

Methods for DNA sequencing are well known and generally available in the art 
and may be used to practice any of the embodiments of the invention. The methods may 
employ such enzymes as the Klenow fragment of DNA polymerase I, Sequenase® (US 
Biochemical Corp., Cleveland, OH), Taq polymerase (Perkin Elmer), thermostable T7 

10 polymerase (Amersham, Chicago, IL), or combinations of polymerases and proofreading 
exonucleases such as those found in the ELONGASE Amplification System marketed by 
GlBCO/BRL (Gaithersburg, MD). Preferably, the process is automated with machines 
such as the Hamilton Micro Lab 2200 (Hamilton, Reno, NV), Peltier Thermal Cycler 
(PTC200; MJ Research, Watertown, MA) and the ABI Catalyst and 373 and 377 DNA 

15 Sequencers (Perkin Elmer). 

The nucleic acid sequences encoding NTPPH-2 may be extended utilizing a partial 
nucleotide sequence and employing various methods known in the art to detect upstream 
sequences, such as promoters and regulatory elements. For example, one method which 
may be employed, restriction-site PCR, uses universal primers to retrieve unknown 

20 sequence adjacent to a known locus. (Sarkar, G. (1 993) PCR Methods Applic. 2:3 1 8-322.) 
In particular, genomic DNA is first amplified in the presence of a primer to a linker 
sequence and a primer specific to the known region. The amplified sequences are then 
subjected to a second round of PCR with the same linker primer and another specific 
primer internal to the first one. Products of each round of PCR are transcribed with an 

25 appropriate RNA polymerase and sequenced using reverse transcriptase. 

Inverse PCR may also be used to amplify or extend sequences using divergent 
primers based on a known region. (Triglia, T. et al. (1988) Nucleic Acids Res. 16:8186.) 
The primers may be designed using commercially available software such as OLIGO 4.06 
Primer Analysis software (National Biosciences Inc., Plymouth, MN) or another 

30 appropriate program to be about 22 to 30 nucleotides in length, to have a GC content pf 
about 50% or more, and to anneal to the target sequence at temperatures of about 68°C to 
72°C. The method uses several restriction enzymes to generate a suitable fragment in the 
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known region of a gene. The fragment is then circularized by intramolecular ligation and 
used as a PCR template. j 

Another method which may be used is capture PCR, which involves PCR 
amplification of DNA fragments adjacent to a known sequence in human and yeast 
5 artificial chromosome DNA. (Lagerstrom, M. et al. (1 991) PCR Methods Applic. 

1:111-119.) In this method, multiple restriction enzyme digestions and ligations may be 
used to place an engineered double-stranded sequence into an unknown fragment of the 
DNA molecule before performing PCR. Another method which may be used to retrieve 
unknown sequences is that of Parker, J.D. et al. (1991 ; Nucleic Acids Res. 19:3055-3060). 

1 6 Additionally, one may use PCR, nested primers, and PromoterFinder™ libraries to walk 
genomic DNAlClontech, Palo Alto, CA). This process avoids the need to screen libraries 
and is useful in finding intron/exon junctions. 

When screening for full-length cDNAs, it is preferable to use libraries that have 
been size-selected to include larger cDNAs. Also, random-primed libraries are preferable 

15 in that they will include more sequences which contain the 5' regions of genes. Use of a 
randomly primed library may be especially preferable for situations in which an oligo d(T) 
library does not yield a full-length cDNA. Genomic libraries may be useful for extension 
of sequence into 5' non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used 

20 to analyze the size or confirm the nucleotide sequence of sequencing or PCR products. In 
particular, capillary sequencing may employ flowable polymers for electrophoretic 
separation, four different fluorescent dyes (one for each nucleotide) which are laser 
activated, and a charge coupled device camera for detection of the emitted wavelengths. 
Output/light intensity may be converted to electrical signal using appropriate software. 

25 (e.g., Genotyper™ and Sequence Navigator™, Perkin Elmer), and the entire process from 
loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for the sequencing of small 
pieces of DNA which might be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments 

30 thereof which encode NTPPH-2 may be used in recombinant DNA molecules to direct 
expression of NTPPH-2, or fragments or functional equivalents thereof, in appropriate host 
cells. Due to the inherent degeneracy of the genetic code, other DNA sequences which 
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encode substantially the same or a functionally equivalent amino acid sequence may be 
produced, and these sequences may be used to clone and express NTPPH-2. 

As will be understood by those of skill in the art, it may be advantageous to 
produce NTPPH-2-encoding nucleotide sequences possessing non-naturally occurring 
5 codons. For example, codons preferred by a particular prokaryotic or eukaryotic host can 
be selected to increase the rate of protein expression or to produce an RNA transcript 
having desirable properties, such as a half-life which is longer than that of a transcript 
generated from the naturally occurring sequence. 

The nucleotide sequences of the present invention can be engineered using 

10 methods generally known in the art in order to alter NTPPH-2 encoding sequences for a 
variety of reasons including, but not limited to, alterations which modify the cloning, 
processing, and/or expression of the gene product. DNA shuffling by random 
fragmentation and PCR reassembly of gene fragments and synthetic oligonucleotides may 
be used to engineer the nucleotide sequences. For example, site-directed mutagenesis may 

15 be used to insert new restriction sites, alter glycosylation patterns, change codon' 
preference, produce splice variants, introduce mutations, and so forth. 

In another embodiment of the invention, natural, modified, or recombinant nucleic 
acid sequences encoding NTPPH-2 may be ligated to a heterologous sequence to encode a 
fusion protein. For example, to screen peptide libraries for inhibitors of NTPPH-2 

20 activity, it may be useful to encode a chimeric NTPPH-2 protein that can be recognized by 
a commercially available antibody. A fusion protein may also be engineered to contain a 
cleavage site located between the NTPPH-2 encoding sequence and the heterologous 
protein sequence, so that NTPPH-2 may be cleaved and purified away from the 
heterologous moiety. 

25 In another embodiment, sequences encoding NTPPH-2 may be synthesized, in 

whole or in part, using chemical methods well known in the art (Caruthers, M.H. et al. 
(1 980) Nucl. Acids Res. Symp. Ser. 21 5-223, and Horn, T. et al. (1 980) Nucl. Acids Res. 
Symp. Ser. 225-232). Alternatively, the protein itself may be produced using chemical 
methods to synthesize the amino acid sequence of NTPPH-2, or a fragment thereof. For 

30 example, peptide synthesis can be performed using various solid-phase techniques 
(Roberge, J. Y. et al. (1 995) Science 269:202-204) and automated synthesis may be 
achieved using the ABI 43 1 A Peptide Synthesizer (Perkin Elmer). 
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The newly synthesized peptide may be substantially purified by preparative high 
performance liquid chromatography. (Creighton, T. (1983) Proteins. Structures and . 
Molecular Principles, WH Freeman and Co., New York, NY.) The composition of the 
synthetic peptides may be confirmed by amino acid analysis or by sequencing, (the 
5 Edman degradation procedure described in Creighton, supra .) Additionally, the amino 
acid sequence of NTPPH-2, or any part thereof, may be altered during direct synthesis 
and/or combined with sequences from other proteins, or any part thereof, to produce a 
variant polypeptide. 

In order to express a biologically active NTPPH-2, the nucleotide sequences 

1 0 encoding NTPPH-2 or derivatives thereof may be inserted into appropriate expression 
vector, i.e., a vector which contains the necessary elements for the transcription and 
translation of the inserted coding sequence. 

Methods which are well known to those skilled in the art may be used to construct 
expression vectors containing sequences encoding NTPPH-2 and appropriate 

15 transcriptional and translational control elements. These methods include uvyjfio 

recombinant DNA techniques, synthetic techniques, and in yjva genetic recombination. 
Such techniques are described in Sambrook, J. et al. (1 989; Molecular Cloning . A 
Laboratory Manual, Cold Spring Harbor Press, Plainview, NY) and Ausubel, F.M. et al. 
(1989; Current Protocols in Molecular Biolnpv, John Wiley & Sons, New York, NY.) 

20 A variety of expression vector/host systems may be utilized to contain and express 

sequences encoding NTPPH-2. These include, but are not limited to, microorganisms 
such as bacteria transformed with recombinant bacteriophage, plasniid, or cosmid DNA 
expression vectors; yeast transformed with yeast expression vectors; insect cell systems 
infected with virus expression vectors (e.g., baculovirus); plant cell systems transformed 

25 with virus expression vectors (e.g., cauliflower mosaic virus (CaMV) or tobacco mosaic 
virus (TMV)) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 
animal cell systems; . 

The invention is not limited by the host cell employed. 

The "control elements" or "regulatory sequences" are those non-translated regions 
30 of the vector (i.e., enhancers, promoters, and 5' and 3' untranslated regions) which interact 
with host cellular proteins to cany out transcription and translation. Such elements may 
vary in their strength and specificity. Depending on the vector system and host utilized, 
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any number of suitable transcription and translation elements, including constitutive and 
inducible promoters, may be used. For example, when cloning in bacterial systems, 
inducible promoters such as the hybrid lacZ promoter of the Bluescript® phagemid 
(Stratagene, La Jolla, CA) or pSportl™ plasmid (GlBCO/BRL), and the like, may be 
5 used. The baculovirus polyhedrin promoter may be used in insect cells. Promoters or 
enhancers derived from the genomes of plant cells (e.g., heat shock, RUBISCO, and 
storage protein genes) or from plant viruses (e.g., viral promoters or leader sequences) may 
be cloned into the vector. In mammalian cell systems, promoters from mammalian genes 
or from mammalian viruses are preferable. If it is necessary to generate a cell line that 

1 0 contains multiple copies of the sequence encoding NTPPH-2, vectors based on S V40 or 
EB V may be used with an appropriate selectable marker. 

In bacterial systems, a number of expression vectors may be selected depending 
upon the use intended for NTPPH-2. For example, when large quantities of NTPPH-2 are 
needed for the induction of antibodies, vectors which direct high level expression of fusion 

1 5 proteins that are readily purified may be used. Such vectors include, but are not limited to, 
multifunctional £. £oli cloning and expression vectors such as Bluescript® (Stratagene), in 
which the sequence encoding NTPPH-2 may be ligated into the vector in frame with 
sequences for the amino-terminal Met and the subsequent 7 residues of B-galactosidase so 
that a hybrid protein is produced, pIN vectors (Van Heeke, G. and S.M. Schuster (1 989) J. 

20 Biol. Chem. 264:5503-5509), and the like. pGEX vectors (Promega, Madison, WI) may 
also be used to express foreign polypeptides as fusion proteins with glutathione 
S-transferase (GST). In general, such fusion proteins are soluble and can easily be purified 
from lysed cells by adsorption to glutathione-agarose beads followed by elution in the 
presence of free glutathione. Proteins made in such sy stems may be designed to include 

25 heparin, thrombin, or factor XA protease cleavage sites so that the cloned polypeptide of 
interest can be released from the GST moiety at will. 

In the yeast Saccharomvces cerevisiae , a number of vectors containing constitutive 
or inducible promoters, such as alpha factor, alcohol oxidase, and PGH, may be used. For 
reviews, see Ausubel {supra ) and Grant et al. (1987; Methods Enzymol. 153:516-544.) 

30 In cases where plant expression vectors are used, the expression of sequences 

encoding NTPPH-2 may be driven by any of a number of promoters. For example, viral 
promoters such as the 35S and 1 9S promoters of CaMV may be used alone or in 

-22- 



CID: <WO 993261 OA 1_l_> 



WO99/32610 PCT/US98/25558 

» 

combination with the omega leader sequence from TMV. (Takamatsu, N. (1987) EMBO 
J . 6:307-3 1 1 .) Alternatively, plant promoters such as the small subunit of RUBISCO or 
heat shock promoters may be used. (Coruzzi, G. etal. (1984) EMBO J. 3:1671-1680; 
Broglie, R. et al. (1984) Science 224:838-843; and Winter, J. et al. (1991) Results Probl. 
5 Cell Differ. 17:85-1 05 .) These constructs can be introduced into plant cells by direct DNA 
transformation or pathogen-mediated transfection. Such techniques are described in a 
number of generally available reviews (Hobbs, S. or Muny, L.E. in McGrawHill 
Yearbook of Science and Technology (1992) McGraw Hill, New York, NY, pp. 191rl96). 

An insect system may also be used to express NTPPH-l For example, in one such 
1 0 system, AntQgrapha californica nuclear polyhidrosis virus (AcNPV) is used as a vector to 
express foreign genes in Spodoptera frufriperfo cells or in Trichoplusia larvae. The 
sequences encoding NTPPH-2 may be cloned into a non-essential region of the virus, such 
as the polyhedrin gene, and placed under control of the polyhedrin promoter. Successful 
insertion of NTPPH-2 will render the polyhedrin gene inactive and produce recombinant 
15 virus lacking coat protein. The recombinant viruses may then be used to infect, e.g., JL 
frugiperda cells or Trichoplusia larvae in which NTPPH-2 may be expressed. (Engelhard, 
E.K. et al. (1994) Proc. Nat, Acad. Sci. 91:3224-3227.) 

In mammalian host cells, a number of viral-based expression systems may be 
utilized. In cases where an adenovirus is used as an expression vector, sequences encoding 
20 NTPPH-2 may be ligated into an adenovirus transcription/translation complex consisting 
of the late promoter and tripartite leader sequence. Insertion in a non-essential El or E3 
region of the viral genome may be used to obtain a viable virus which is capable of 
expressing NTPPH-2 in infected host cells. (Logan, J. and T. Shenk (1984) Proc. Natl. 
Acad. Sci. 81:3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 
25 virus (RSV) enhancer, may be used to increase expression in mammalian host cells. 

Human artificial chromosomes (HACs) may also be employed to deliver larger * 
fragments of DNA than can be contained and expressed in a plasmid. HACs of about 6 kb 
to 10 mb are constructed and delivered via conventional delivery methods (liposomes, 
polycationic amino polymers, or vesicles) for therapeutic purposes. 
30 Specific initiation signals may also be used to achieve more efficient translation of 

sequences encoding NTPPH-2. Such signals include the ATG initiation codon and 
adjacent sequences. In cases where sequences encoding NTPPH-2 and its initiation codon 
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and upstream sequences are inserted into the appropriate expression vector, no additional 
transcriptional or translations! control signals may be needed. However, in cases ^vhere 
only coding sequence, or a fragment thereof, is inserted, exogenous translational control 
signals including the ATG initiation codoh should be provided. Furthermore, the initiation 
5 codon should be in the correct reading frame to ensure translation of the entire insert. 
Exogenous translational elements and initiation codons may be of various origins, both 
natural and synthetic. The efficiency of expression may be enhanced by the inclusion of 
enhancers appropriate for the particular cell system used, such as those described in the 
literature. (Scharf, D. et al. (1994) Results Probl. Cell Differ. 20:125-162.) 

10 In addition, a host cell strain may be chosen for its ability to modulate expression 

of the inserted sequences or to process the expressed protein in the desired fashion. Such 
modifications of the polypeptide include, but are not limited to* acetylation, carboxylation, 
glycosylation, phosphorylation, lipidation, and acylation. Pqst-translational processing 
which cleaves a "prepro" form of the protein may also be used to facilitate correct 

15 insertion, folding, and/or function. Different host cells which have specific cellular 
machinery and characteristic mechanisms for post-translational activities (e.g., CHO, 
HeLa, MDCK, HEK293, and WI38), are available from the American Type Culture 
Collection (ATCC, Bethesda, MD) and may be chosen to ensure the correct modification 
and processing of the foreign protein. 

20 For long term, high yield production of recombinant proteins, stable expression is 

preferred. For example, cell lines capable of stably expressing NTPPH-2 can be 

i 

transformed using' expression vectors which may contain viral origins of replication and/or 
endogenous expression elements and a selectable marker gene on the same or on a separate 
vector. Following the introduction of the vector, cells may be allowed to grow for about 1 
25 to 2 days in enriched media before being switched to selective media. The purpose of the 
selectable marker is to confer resistance to selection, and its presence allows growth and 
recovery of cells which successfully express the introduced sequences. Resistant clones of 
stably transformed cells may be proliferated using tissue culture techniques appropriate to 
the cell type. 

30 Any number of selection systems may be used to recover transformed cell lines. 

These include, but are not limited to, the herpes simplex virus thymidine kinase genes 
(Wigler, M. et al. (1977) Cell 1 i:223-32) and adenine phosphoribosyltransferase genes 
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(Lowy, I. et al. (1980) Cell 22:817-23), which can be employed in tk or apr cells, 
respectively. Also, antimetabolite, antibiotic, or herbicide resistance can be used as the 
basis for selection. For example, dhfr confers resistance to methotrexate (Wigler, M. et al. 
( 1 980) Proc. Natl. Acad. ScL 77:3567-70); npt confers resistance to the aminoglycosides 
5 neomycin and G-418 (Colbere-Garapin, R et al. (1981) J. Mol. Biol. 150:1-14); and als or 
pat confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, respectively. 
(Muny, supra.) Additional selectable genes have been described, e.g., trpB, which allows 
cells to utilize indole in place of tryptophan, or hisD, which allows cells to utilize histinol 
inplaceofhistidine. (Hartman, S.C. and R.C. Mulligan (1988) Proc. Natl. Acad. Sci. 

1 0 85:8047-5 1 .) Recently, the use of visible markers has gained popularity with such 
markers as anthocyanins, B glucuronidase and its substrate GUS, and luciferase and its 
substrate luciferin. These markers can be used not only to identify trarisformants, but also 
to quantify the amount of transient or stable protein expression attributable to a specific 
vector system. (Rhodes, C.A. et al. (1995) Methods Mol. Biol. 55:121-131.) 

15 Although the presence/absence of marker gene expression suggests that the gene of 

interest is also present, the presence and expression of the gene may need to be confirmed. 
For example, if the sequence encoding NTPPH-2 is inserted within a marker gene 
sequence, transformed cells containing sequences encoding NTPPH-2 can be identified by 
the absence of marker gene function. Alternatively, a marker gene can be placed in 

20 tandem with a sequence encoding NTPPH-2 under the control of a single promoter. 
Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem gene as well. 

Alternatively, host cells which contain the nucleic acid sequence encoding 
NTPPH-2 and express NTPPH-2 may be identified by a variety of procedures known to 

25 those of skill in the art. These procedures include, but are not limited to, DNA-DNA or 
DNA-RNA hybridizations and protein bioassay or immunoassay techniques which include 
membrane, solution, or chip based technologies for the detection and/or quantification of 
nucleic acid or protein sequences. 

The presence of polynucleotide sequences encoding NTPPH-2 can be detected by 

30 DNA-DNA or DNA-RNA hybridization or amplification using probes or fragments or 
fragments of polynucleotides encoding NTPPH-2. Nucleic acid amplification based 
assays involve the use of oligonucleotides or oligomers based on the sequences encoding 
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NTPPH-2 to detect transfoimants containing DNA or RNA encoding NTPPH-2. 

A variety of protocols for detecting and measuring the expression of NTP^H-2, 
using either polyclonal or monoclonal antibodies specific for the protein, are known in the 
art. Examples of such techniques include enzyme-linked immunosorbent assays 
5 (ELISAs), radioimmunoassays (RIAs), and fluorescence activated cell sorting (FACS). A 
two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two 
non-interfering epitopes on NTPPH-2 is preferred, but a competitive binding assay may be 
employed. These and other assays are well described in the art, e.g., in Hampton, R. et al. 
(1 990; Serological Methods, a Laboratory Manual. APS Press, St Paul, MN) and in 

10 Maddox,D.E.etaL(1983; J.Exp.Med. 158:1211-1216.) 

A wide variety of labels and conjugation techniques are known by those skilled in 
the art and may be used in various nucleic acid and amino acid assays. Means for 
producing labeled hybridization or PCR probes for detecting sequences related to 
polynucleotides encoding NTPPH-2 include oligolabeling, nick translation, end-labeling, 

15 or PCR amplification using a labeled nucleotide. Alternatively, the sequences encoding 
NTPPH-2, or any fragments thereof, may be cloned into a vector for the production of an 
mRNA probe. Such vectors are known in the art, are commercially available, and may be 
used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase 
such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using 

20 a variety of commercially available kits, such as those provided by Pharmacia & Upjohn 
(Kalamazoo, MI), Promega (Madison, WI), and U.S. Biochemical Corp. (Cleveland, OH). 
Suitable reporter molecules or labels which may be used for ease of detection include 
radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents, as well as 
substrates, cofactors, inhibitors, magnetic particles, and the like. 

25 Host cells transformed with nucleotide sequences encoding NTPPH-2 may be 

cultured under conditions suitable for the expression and recovery of the protein from cell 
culture. The protein produced by a transformed cell may be secreted or contained 
intracellularly depending on the sequence and/or the vector used. As will be understood 
by those of skill in the art, expression vectors containing polynucleotides which encode 

30 NTPPH-2 may be designed to contain signal sequences which direct secretion of NTPPH- 

2 through a prokaryotic or eukaryotic cell membrane. Other constructions may be used to 
join sequences encoding NTPPH-2 to nucleotide sequences encoding a polypeptide 
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domain which will facilitate purification of soluble proteins. Such purification facilitating 
domains include, but are not limited to, metal chelating peptides such as 
histidine-tryptophan modules that allow purification on immobilized metals, protein A 
domains that allow purification on immobilized immunoglobulin, and the domain utilized 

5 in the FLAGS extension/affinity purification system (Immunex Corp,, Seattle, WA). The 
inclusion of cleavable linker sequences, such as those specific for Factor XA or 
enterokinase (Invitrogen, San Diego, C A), between the purification domain and the 
NTPPH-2 encoding sequence may be used to facilitate purification. One such expression 
vector provides for expression of a fusion protein containing NTPPH^2 and a nucleic acid 

10 encoding 6 histidine residues preceding a thipredoxin or an enterokinase cleavage site, 
-pj^ hi s tidine residues facilitate purification on immobilized metal ion affinity ~ — ' 
chromatography (IMAC; described in Porath, J. et al. (1 992) Prot. Exp. Purif. 3: 263-281), 
while the enterokinase cleavage site provides a means for purifying NTPPH-2 from the 
fusion protein. A discussion of vectors which contain fusion proteins is provided in Kroll, 

15 DJ. et al. (1993; DNA Cell Biol. 12:441-453.) 

Fragments of NTPPH-2 may be produced not only by recombinant production,, but 
. also by direct peptide synthesis using solid-phase techniques. (Merrifield J. (1 963) J. Am. 
Chem. Soc. 85:2149-2154.) Protein synthesis may be performed by manual techniques or 
by automation. Automated synthesis may be achieved, e.g., using the Applied Biosystems 

20 431 A Peptide Synthesizer (Perkin Elmer). Various fragments of NTPPH-2 may be 
synthesized separately and then combined to produce the full length molecule. 

THERAPEUTICS 

Chemical and structural homology exists between NTPPH-2 and human NTPPH-1 
25 (SEQ ID NO:3). Transcripts hybridizing to the NTPPH-2 cDNA were detected in 

cartilage, in testes, trachea, and bone marrow tissues. Electronic northern analysis showed 
expression of NTPPH-2 in tissues with an immunological association (57%; including 
synovial and cartilage tissues), and in cancerous tissues (26%). Therefore, NTPF i H-2 
appears to play a role in arthropathies, immunological disorders, and cancers. 
30 Therefore, in one embodiment, an antagonist of NTPPH-2 may be administered to 

a subject to prevent or treat an arthropathy. Arthropathies include, but are not limited to, 
. Behcet's syndrome, Charcot osteoarthropathy, CPPD disease, diabetic neuropathic 
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arthropathy, degenerative joint disease, fibromyalgias, hemachromatosis, hemophilic 
arthropathy, Jaccoud's type arthropathy, lupus erythematosus, mixed connective tissue 
disease, Muckle- Wells syndrome, osteoarthritis, progressive systemic sclerosis, 
pseudogout, psoriasis, Reiter's syndrome, rheumatoid arthritis, Sjogren's syndrome, and 
5 spondyloarthropathies. In one aspect, an antibody which specifically binds NTPPH-2 may 
be used directly as an antagonist or indirectly as a targeting or delivery mechanism for 
bringing a pharmaceutical agent to cells or tissue which express NTPPH-2. 

In another embodiment, a vector expressing the complement of the polynucleotide 
encoding NTPPH-2 may be administered to a subject to treat or prevent an arthropathy 

10 including, but not limited to, those described above. 

In another embodiment, an antagonist of NTPPH-2 may be administered to a 
subject to prevent or treat an immunological disorder. Immunological disorders include, 
but are not limited to, AIDS, Addison's. disease, adult respiratory distress syndrome, 
allergies, anemia, asthma, atherosclerosis, bronchitis, cholecystitis, Crohn ! s disease, 

15 ulcerative colitis, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, 
erythema nodosum, atrophic gastritis, glomerulonephritis, gout, Graves' disease, 
hypereosinophilia, irritable bowel syndrome, multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflammation, osteoporosis, pancreatitis, polymyositis, 
scleroderma, autoimmune thyroiditis and ulcerative colitis; complications of cancer, 

20 hemodialysis, and extracorporeal circulation; viral, bacterial, fungal, parasitic, protozoal, 
and helminthic infections; and trauma. Such disorders may be characterized by the 
production of cytokines and the multiplication of leukocytes, macrophages, and other cells 
which may cause tissue damage or the inappropriate proliferation of tissues in response to 
inflammatory mediators or the generation of granulomatous tissues. In one aspect, an 

25 antibody which specifically binds NTPPH-2 may be used directly as an antagonist or 

indirectly as a targeting or delivery mechanism for bringing a pharmaceutical agent to cells 
or tissue which express NTPPH-2. 

In another embodiment, a vector expressing the complement of the polynucleotide 
encoding NTPPH-2 may be administered to a subject to treat or prevent an immunological 

30 disorder including, but not limited to, those described above. 

In another embodiment, an antagonist of NTPPH-2 may be administered to a 
subject to prevent or treat a cancer. Cancers include, but are not limited to, 

-28- 



3CID: <WO 9932610A1J_> 



WO99/32610 PCT/US98/25558 
adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, 
and, in particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, 
cartilage, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, 
muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, 
5 thymus, thyroid, and uterus. In one aspect, an antibody which specifically binds NTPPH-2 
may be used directly as an antagonist or indirectly as a targeting or delivery mechanism 
for bringing a pharmaceutical agent to cells or tissue which express NTPPH-2. 

In another embodiment, a vector expressing the complement of the polynucleotide 
encoding NTPPH-2 may be administered to a subject to treat or prevent a cancer 

10 including, but not limited to, those described above. 

In other embodiments/any of the proteins, antagonists, antibodies, agonists, 
complementary sequences, or vectors of the invention may be administered in combination 
with other appropriate therapeutic agents. Selection of the appropriate agents for use in 
combination therapy may be made by one of ordinaiy skill in the art, according to 

15 conventional pharmaceutical principles. The combination of therapeutic agents may act 
synergistically to effect the treatment or prevention of the various disorders described 
above. Using this approach, one may be able to achieve therapeutic efficacy with lower 
dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of NTPPH-2 may be produced using methods which are generally 

20 known in the art In particular, purified NTPPH-2 may be used to produce antibodies or to 
screen libraries of pharmaceutical agents to identify those which specifically bind NTPPH- 
2. Antibodies to NTPPH-2 may also be generated using methods that are well known in 
the art Such antibodies may include, but are not limited to, polyclonal, monoclonal, 
chimeric, and single chain antibodies, Fab fragments, and fragments produced by a Fab 

25 expression library. Neutralizing antibodies (i.e., those which inhibit dimer formation) are 
especially preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, 
humans, and others may be immunized by injection with NTPPH-2 or with any fragment 
or oligopeptide thereof which has immunogenic properties. Depending on the host 

30 species, various adjuvants may be used to increase immunological response. Such 
adjuvants include, but are not limited to, Freund's, mineral gels such as aluminum 
hydroxide, and surface active substances such as lysolecithin, pluronic polyols, 
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polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in 
humans, BCG (bacilli Calmette-Guerin) and Corvnebacterium parvum are especially 
preferable. 

For the production of antibodies, binding proteins, or peptides which bind 
5 specifically to NTPPH-2, libraries of single chain antibodies, Fab fragments, 
other antibody fragments, non-antibody protein domains, or peptides may be 
screened. The libraries could be generated using phage display, other recombinant DNA 
methods, or peptide synthesis (Vaughan, T. J.et al.(l 996) Nature Biotechnology 
14:309-314). The libraries would be screened using methods which are well known in the 

10 art to identify sequences which demonstrate specific binding to NTPPH-2. 

It is preferred that the oligopeptides, peptides, or fragments used to induce 
antibodies to NTPPH-2 have an amino acid sequence consisting of at least about 5 amino 
acids, and, mpre preferably, of at least about 10 amino acids. It is also preferable that 
these oligopeptides, peptides, or fragments are identical to a portion of the amino acid 

15 sequence of the natural protein and contain the entire amino acid sequence of a small, 
naturally occurring molecule. Short stretches of NTPPH-2 amino acids may be fused with 
those of another protein, such as KLH, and antibodies to the chimeric molecule may be 
produced. 

Monoclonal antibodies to NTPPH-2 may be prepared using any technique which 
20 provides for the production of antibody molecules by continuous cell lines in culture. 
These include, but are not limited to, the hybridoma technique, the human B-cell 
hybridoma technique, and the EBV-hybridoma technique. (Kohler, G. et al. (1 975) Nature 
256:495-497; Kozbor, D et al. (1985) J. Immunol. Methods 81:31-42; Cote, R.J. et al. 
(1983) Proc. Natl. Acad. Sci. 80:2026-2030; and Cole, S.P. et al. (1984) Mol. Cell Biol. 
25 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such 
as the splicing of mouse antibody genes to human antibody genes to obtain a molecule 
with appropriate antigen specificity and biological activity, can be used. (Morrison, S.L. 
et al. (1984) Proc. Natl. Acad. Sci. 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 
30 312:604-608; and Takeda, S. et al. (1985) Nature 314:452-454.) Alternatively, techniques 
described for the production of single chain antibodies may be adapted, using methods 
known in the art, to produce NTPPH-2-specific single chain antibodies. Antibodies with 
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related specificity,, but of distinct idiotypic composition, may be generated by chain 
shuffling from random combinatorial immunoglobulin libraries. (Burton D.R. (1 991) 
Proc. Natl. Acad. Sci. 88:11 120-1 1 123.) 

Antibodies may also be produced by inducing in vivo production in the 
5 lymphocyte population or by screening immunoglobulin libraries or panels of highly 
specific binding reagents as disclosed in the literature. (Orlandi, R. etal. (1989) Proc. 
Nad. Acad. Sci. 86: 3833-3837, and Winter, G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for NTPPH-2 may also be 
generated. For example, such fragments include, but are not limited to, F(ab')2 fragments 

1 0 produced by pepsin digestion of the antibody molecule and Fab fragments generated by 
reducing the disulfide bridges of the F(ab')2 fragments. Alternatively, Fab expression 
libraries may be constructed to allow rapid and easy identification of monoclonal Fab 
fragments with the desired specificity. (Huse, W.D. et al. (1989) Science 254:1275-1281.) 
Various immunoassays may be used to identify antibodies having the desired 

15 specificity. Numerous protocols for competitive binding or immunoradiometric assays 
using either polyclonal or monoclonal antibodies with established specificities are well 
known in the art. . Such immunoassays typically involve the measurement of complex 
formation between NTPPH-2 and its specific antibody. A two-site, monoclonal-based 
immunoassay utilizing monoclonal antibodies reactive to two non-interfering NTPPH-2 

20 epitopes is preferred, but a competitive binding assay may also be employed. (Maddox, 
supra .) 

In another embodiment of the invention, the polynucleotides encoding NTPPH-2, 
or any fragment or complement thereof, may be used for therapeutic purposes. In one 
aspect, the complement of the polynucleotide encoding NTPPH-2 may be used in 

25 situations in which it would be desirable to block the transcription of the mRNA. In 
particular, cells may be transformed with sequences complementary to polynucleotides 
encoding NTPPH-2. Thus, complementary molecules or fragments may be used to 
modulate NTPPH-2 activity, or to achieve regulation of gene function. Such technology is 
now well known in the art, and sense or antisense oligonucleotides or larger fragments can 

30 be designed from various locations along the coding or control regions of sequences 
encoding NTPPH-2. 

Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia 
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viruses, or from various bacterial plasmids, may be used for delivery of nucleotide 
sequences to the targeted organ, tissue, or cell population. Methods which are well known 
to those skilled in the art can be used to construct vectors which will express nucleic acid 
sequence complementary to the polynucleotides of the gene encoding NTPPH-2. These 

5 techniques are described, e.g., in Sambrook (supra) and in Ausubel (supra) . 

Genes encoding NTPPH-2 can be turned off by transforming a cell or tissue with 
expression vectors which express high levels of a polynucleotide or fragment thereof 
encoding NTPPH-2. Such constructs may be used to introduce untranslatable sensie or 
antisense sequences into a cell. Even in the absence of integration into the DNA, such 

10 vectors may continue to transcribe RNA molecules until they are disabled by endogenous 
nucleases. Transient expression may last for a month or more with a non-replicating 
vector, and may last even longer if appropriate replication elements are part of the vector 
system. 

As mentioned above, modifications of gene expression can be obtained by 

15 designing complementary sequences or antisense molecules (DNA, RNA, or PNA) to the 
control, 5', or regulatory regions of the gene encoding NTPPH-2. Oligonucleotides 
derived from the transcription initiation site, e.g., between about positions -10 and +10 
from the start site, are preferred. Similarly, inhibition can be achieved using triple helix 
base-pairing methodology. Triple helix pairing is useful because it causes inhibition of the 

20 ability of the double helix to open sufficiently for the binding of polymerases, transcription 
factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have 
been described in the literature. (Gee, J.E. et al. (1994) in Huber, B.E. and B.I. Carr, 
Molecular and Immunologic Approaches. Futura Publishing Co., Mt Kisco, NY.) A 
complementary sequence or antisense molecule may also be designed to block translation 

25 of mRNA by preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific 
cleavage of RNA. The mechanism of ribozyme action involves sequence-specific 
hybridization of the ribozyme molecule to complementary target RNA, followed by 
endonucleolytic cleavage. For example, engineered hammerhead motif ribozyme 

30 molecules specifically and efficiently catalyze endonucleolytic cleavage of sequences 
encoding NTPPH-2. 

Specific ribozyme cleavage sites within any potential RNA target are initially 
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identified by scanning the target molecule for ribozyme cleavage sites, including the 
following sequences: GUA, GUU, and GUC. Once identified, short RNA sequences of 
between 1 5 and 20 ribonucleotides corresponding to the region of the target gene 
containing the cleavage site may be evaluated for secondary structural features which may 
5 render the oligonucleotide inoperable. The suitability of candidate targets may also be 
evaluated by testing accessibility to hybridization with complementary oligonucleotides 
using ribonuclease protection assays. 

. Complementary ribonucleic acid molecules and ribozymes of the invention may be 
prepared by any method known in the art for the synthesis of nucleic acid molecules. 
10 These include techniques for chemically synthesizing oligonucleotides such as solid phase 
phosphoramidite chemical synthesis. Alternatively, RNA molecules may be generated by 
in JOfiS and in yjyji transcription of DNA sequences encoding NTPPH-2. Such DNA 
sequences may be incorporated into a wide variety of vectors with suitable RNA 
polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that 
15 synthesize complementary RNA constitutively or inducibly can be introduced into cell 
lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. 
Possible modifications include, but are not limited to, the addition of flanking sequences at 
the 5' and/or 3' ends of the molecule or the use of phosphorothioate or 2' O-methyl rather 
20 than phosphodiesterase linkages within the backbone of the molecule. This concept is 
inherent in the production of PNAs and can be extended in all of these molecules by the 
inclusion of nontraditional bases such as inosine, queosine, and wybutosine, as well as 
acetyl-, methyl, thio-, and similarly modified forms of adenine, cytidine, guanine, 
thymine, and uridine which are not as easily recognized by endogenous endonucleases. 
25 Many methods for introducing vectors into cells or tissues are available and 

equally suitable for use in yiy^, in yifia, and ex vivo . For ex vivo therapy, vectors may be 
introduced into stem cells taken from the patient and clonally propagated for autologous 
transplant back into that same patient. Delivery by transfection, by liposome injections, or 
by polycationic amino polymers may be achieved using methods which are well known in 
30 the art, such as those described in Goldman, C.K. et al. (19*97; Nature Biotechnology 
15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in 
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need of such therapy, including, e.g., mammals such as dogs, cats, cows, horses, rabbits, 
monkeys, and most preferably, humans. 

An additional embodiment of the invention relates to the administration of a 
pharmaceutical composition, in conjunction with a pharmaceutical^ acceptable carrier, for 
5 any of the therapeutic effects discussed above. Such pharmaceutical compositions may 
consist of NTPPH-2, antibodies to NTPPH-2, and mimetics, agonists, antagonists, or 
inhibitors of NTPPH-2. The compositions may be administered alone or in combination 
with at least one other agent, such as a stabilizing compound, which may be administered 
in any sterile, biocompatible pharmaceutical carrier including, but not limited to, saline, 

10 buffered saline, dextrose, and water. The compositions may be administered to a patient 
alone, or in combination with other agents, drugs or hormones. 

The pharmaceutical compositions utilized in this invention may be administered by 
any number of routes including, but not limited to, oral, intravenous, intramuscular, 
intra-arterial, intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, 

15 intraperitoneal, intranasal, enteral, topical, sublingual, or rectal means. 

In addition to the active ingredients, these pharmaceutical compositions may 
contain suitable pharmaceutically-acceptable carriers comprising excipients and auxiliaries 
which facilitate processing of the active compounds into preparations which can be used 
pharmaceutical^. Further details on techniques for formulation and administration may 

20 be found in the latest edition of Remington's Pharmaceu tical Sciences . (Maack Publishing 
Co., Easton, PA.) 

Pharmaceutical compositions for oral administration can be formulated using 
pharmaceutical^ acceptable carriers well known in the art in dosages suitable for oral 
administration. Such carriers enable the pharmaceutical compositions to be formulated as 

25 tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for 
ingestion by the patient. 

Pharmaceutical preparations for oral use can be obtained through combining active 
compounds with solid excipient and processing the resultant mixture of granules 
(optionally, after grinding) to obtain tablets or dragee cores. Suitable auxiliaries can be 

30 added, if desired. Suitable excipients include carbohydrate or protein fillers, such as 
sugars, including lactose, sucrose, mannitol, and sorbitol; starch from corn, wheat, rice, 
potato, or other plants; cellulose, such as methyl cellulose, 
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hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; gums, including 
arabic and tragacanth; and proteins, such as gelatin and collagen. If desired, disintegrating 
or solubilizing agents may be added, such as the cross-linked polyvinyl pyrrolidone, agar, 
and alginic acid or a salt thereof, such as sodium alginate. 
5 Dragee cores may be used in conjunction with suitable coatings, such as 

concentrated sugar solutions, which may also contain gum arabic, talc, 
polyvinylpyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer 
solutions, and suitable organic solvents or solvent mixtures. Dyestuffs or pigments may 
be added to the tablets or dragee coatings for product identification or to characterize the 
10 quantity of active compound, i.e., dosage. 
""■ ; Pharmaceutical preparations which can be used orally include push-fit capsules 

made of gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as 
glycerol or sorbitol. Push-fit capsules can contain active ingredients mixed with fillers or 
binders, such as lactose or starches, lubricants, such as talc or magnesium stearate, and, 
15 optionally, stabilizers. In soft capsules, the active compounds may be dissolved or 

suspended in suitable liquids, such as fatty oils, liquid, or liquid polyethylene glycol with 
or without stabilizers. 

Pharmaceutic^ fomudauoiis sri . 
formulated in aqueous solutions, preferably in physiologically compatible buffers such as 
20 Hanks's solution, Ringer's solution, or physiologically buffered saline. Aqueous injection 
suspensions may contain substances which increase the viscosity of the suspension, such 
as sodium carboxymethyl cellulose, sorbitol, or dextran. Additionally, suspensions of the 
active compounds may be prepared as appropriate oily injection suspensions. Suitable 
lipophilic solvents or vehicles include fatty oils, such as sesame oil, or synthetic fatty acid 
25 esters, such as ethyl oleate, triglycerides, or liposomes. Non-lipid polycationic amino 
polymers may also be used for delivery. Optionally, the suspension may also contain 
suitable stabilizers or agents to increase the solubility of the compounds arid allow for the 
preparation of highly concentrated solutions. 

For topical or nasal administration, penetrants appropriate to the particular barrier 
30 to be permeated are used in the formulation. Such penetrants are generally known in the 
art. 

The pharmaceutical compositions of the present invention may be manufactured in 
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a manner that is known in the art, e.g., by means of conventional mixing, dissolving, 
granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping, or 
lyophilizing processes. 

The pharmaceutical composition may be provided as a salt and can be formed with 
5 many acids, including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, 
malic, and succinic acid. Salts tend to be more soluble in aqueous or other protonic 
solvents than are the corresponding free base forms. In other cases, the preferred 
preparation may be a lyophilized powder which may contain any or all of the following: 1 
mM to. 50 mM histidine, 0.1% to 2% sucrose, and 2% to 1% mannitol, at a pH range of 4.5 
10 to 5.5, that is combined with buffer prior to use. 

After pharmaceutical compositions have been prepared, they can be placed in an 
appropriate container and labeled for treatment of an indicated condition. For 
administration of NTPPH-2, such labeling would include amount, frequency, and method 
of administration. 

15 Pharmaceutical compositions suitable for use in the invention include compositions 

wherein the active ingredients are contained in an effective amount to achieve the intended 
purpose. The determination of an effective dose is well within the capability of those , 
skilled in the art. 

For any compound, the therapeutically effective dose can be estimated initially 
20 either in cell culture assays of neoplastic cells, e.g., or in animal models, usually mice, 
rabbits, dogs, or pigs. An animal model may also be used to determine the appropriate 
concentration range and route of administration. Such information can then be used to 
determine useful doses and routes for administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, e.g. 
25 NTPPH-2 or fragments thereof, antibodies of NTPPH-2, and agonists, antagonists or 
inhibitors of NTPPH-2, which ameliorates the symptoms or condition. Therapeutic 
efficacy and toxicity may be determined by standard pharmaceutical procedures in cell 
cultures or with experimental animals, such as by calculating the ED50 (the dose 
therapeutically effective in 50% of the population) or LD50 (the dose lethal to 50% of the 
30 population) statistics. The dose ratio between therapeutic and toxic effects is the 
therapeutic index, and it can be expressed as the LD50/ED50 ratio. Pharmaceutical 
. compositions which exhibit large therapeutic indices are preferred. The data obtained 
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from cell culture assays and animal studies is used in formulating a range of dosage for 
human use. The dosage contained in such compositions is preferably within a range of 
circulating concentrations that include the ED50 with litde or no toxicity. The dosage 
varies within this range depending upon the dosage form employed, the sensitivity of the 
5 patient, and me route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related 
to the subject requiring treatment. Dosage and administration are adjusted to provide 
sufficient levels of the active moiety or to maintain the desired effect. Factors which may 
be taken into account include the severity of the disease state, the general health of the 
10 subject, the age, weight, and gender of the subject, diet, time and frequency of 

administration, drug combinauon(s), reaction sensitivities, and tolerance/response to 
therapy. Long-acting pharmaceutical compositions may be administered every 3 to 4 
days, every week, or once every two weeks depending on the half-life and clearance rate of 
the particular formulation. 
15 Normal dosage amounts may vary from 0.1 ^g to 100,000 fig, up to a total dose of 

about 1 gram, depending upon the route of administration. Guidance as to particular 
dosages and methods of delivery is provided in the literature and generally available to 
practitioners in the art. Those skilled in the art will employ different formulations for •" 
nucleotides than for proteins or their inhibitors. Similarly, delivery of polynucleotides or 
20 polypeptides will be specific to particular cells, conditions,, locations, etc. 

DIAGNOSTICS 

In another embodiment, antibodies which specifically bind NTPPH-2 may be used 
for the diagnosis of disorders characterized by expression of NTPPH-2, or in assays to 

25 monitor patients being treated with NTPPH^2 or agonists, antagonists, and inhibitors of 
NTPPH-2. Antibodies useful for diagnostic purposes may be prepared in the same manner 
as those described above for therapeutics. Diagnostic assays for NTPPH-2 include 
methods which utilize the antibody and a label to detect NTPPH-2 in human body fluids or 
in extracts of cells or tissues. The antibodies may be used with or without modification, 

30 and may be labeled by covalent or non-covalent joining with a reporter molecule. A wide 
variety of reporter molecules, several of which are described above, are known in the art 
and may be used. 
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A variety of protocols for measuring NTPPH-2, including ELISAs, RIAs, and 
FACS, are known in the ait and provide a basis for diagnosing altered or abnormal levels 
of NTPPH-2 expression. Normal or standard values for NTPPH-2 expression are 
established by combining body fluids or cell extracts taken from normal mammalian 
5 subjects, preferably human, with antibody to NTPPH-2 under conditions suitable for 
complex formation. The method for detecting NTPPH-2 in a biological sample would 
comprise the steps of: a) providing a biological sample; b) combining the biological 
sample and an anti-NTPPH-2 antibody under conditions which are suitable for complex 
formation to occur between NTPPH-2 and the antibody; and c) detecting complex 

10 formation between NTPPH-2 and the antibody, thereby establishing the presence of 
NTPPH-2 in the biological sample.. The amount of complex formation then may be 
quantified by various methods, preferably by photometric means. Quantities of NTPPH-2 
expressed in subject, control, and disease samples from biopsied tissues are compared with 
the standard values. Deviation between standard and subject values establishes the 

15 parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding NTPPHf-2 
may be used for diagnostic purposes. The polynucleotides which may be used inciude, 
oligonucleotide sequences, complementary KNA and DNA molecules, and PNAs. The 
polynucleotides may be used to detect and quantitate gene expression in biopsied tissues in 

20 which expression of NTPPH-2 may be correlated with disease. The diagnostic assay may 
be used to distinguish between absence, presence, and excess expression of NTPPH-2, and 
to monitor regulation of NTPPH-2 levels during therapeutic intervention. 

In one aspect, hybridization with PGR probes which are capable of detecting 
polynucleotide sequences, including genomic sequences, encoding NTPPH-2 or closely 

25 related molecules may be used to identify nucleic acid sequences which encode NTPPH-2. 
The specificity of the probe, whether it is made from a highly specific region (e.g., the 5' 
regulatory region) or from a less specific region (e.g., the 3' coding region), and the 
stringency of the hybridization or amplification (maximal, high, intermediate, or low), will 
determine whether the probe identifies only naturally occurring sequences encoding 

30 NTPPH-2, alleles, or related sequences. 

Probes may also be used for the detection of related sequences, and should 
preferably contain at least 50% of the nucleotides from any of the NTPPH-2 encoding 
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sequences. The hybridization probes of the subject invention may be DNA or RNA and 
may be derived from the sequence of SEQ ID NO:2 or from genomic sequences including 
promoter and enhancer elements and introns of the naturally occurring NTPPH-2. 

Means for producing specific hybridization probes for DNAs encoding NTPPH-2 
include the cloning of polynucleotide sequences encoding NTPPH-2 or NTPPH-2 
derivatives into vectors for the production of mRNA probes: Such vectors are. known in 
the art, are commercially available, and may be used to synthesize RNA probes in yjlffi by 
means of the addition of the appropriate RNA polymerases and the appropriate labeled 
nucleotides. Hybridization probes may be labeled by a variety of reporter , groups, e.g!, by 
radionuclides such as 32 P or 35 S, or by enzymatic labels, such as alkaline phosphatase 
coupled to the probe via avidih/biotin coupling systems, and the like. • 

Polynucleotide sequences encoding NTPPH-2 may be used for the diagnosis of 
conditions or disorders which are associated with expression of NTPPH-2. Examples of 
such conditions or disorders include, but are not limited to, arthropathies, e.g., Behcet's 
15 syndrome, Charcot osteoarthropathy, CPPD disease, diabetic neuropathic arthropathy, 
degenerative joint disease, fibromyalgias, hemachromatosis, hemophilic arthropathy, 
Jaccoud's type arthropathy, lupus erythematosus, mixed connective tissue disease, 
Muckle-Wells syndrome, osteoarthritis, progressive systemic sclerosis, pseudogout, 
psoriasis, Reiter's syndrome, rheumatoid arthritis, Sjogren's syndrome, and 
20 spondyloarthropathies; immunological disorders, e.g., AIDS, Addison's disease, adult 
respiratory distress syndrome, allergies, anemia, asthma, atherosclerosis, bronchitis, 
cholecystitis, Crohn's disease, ulcerative colitis, atopic dermatitis, dermatomyositis, 
diabetes mellitus, emphysema, erythema nodosum, atrophic gastritis, glomerulonephritis, 
gout, Graves' disease, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, 
25 myasthenia gravis* myocardial or pericardial inflammation, osteoporosis, pancreatitis, 
polymyositis, scleroderma, and autoimmune thyroiditis; complications of cancer, 
hemodialysis, and extracorporeal circulation; viral, bacterial, fungal, parasitic, protozoal, 
and helminthic infections; and trauma; and cancers, e,g., as adenocarcinoma, leukemia, 
lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and particularly cancers of 
30 the adrenal gland, bladder, bone, bone marrow, brain, breast, cartilage; cervix, gall 

bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, 
parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and 
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uterus. 

The polynucleotide sequences encoding NTPPH-2 may be used in SoixtheJn or 
northern analysis, dot blot, or other membrane-based technologies; in PGR technologies; 
in dipstick, pin, and ELISA assays; and in microarrays utilizing fluids or tissues from 
5 patient biopsies to detect altered NTPPH-2 expression. Such qualitative or quantitative 
methods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding NTPPH-2 may be useful 
in assays that detect the presence of associated disorders, particularly those mentioned 
above. Hie nucleotide sequences encoding NTPPH-2 may be labeled by standard methods 

1 0 and added to a fluid or tissue sample from a patient under conditions suitable for the 
fomiation of hybridization complexes. After a suitable incubation period, the sample is 
washed and the signal is quantitated and compared with a standard value. If the amount of 
signal in the patient sample is significantly altered from that of a comparable control 
sample, the nucleotide sequences have hybridized with nucleotide sequences in the 

15 sample, and the presence of altered levels of nucleotide sequences encoding NTPPH-2 in 
the sample indicates the presence of the associated disorder. Such assays may also be used 
to evaluate the efficacy of a particular therapeutic treatment regimen in animal studies, in 
clinical trials, or in monitoring the treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with 

20 expression of NTPPH-2, a normal or standard profile for expression is established. This 
may be accomplished by combining body fluids or cell extracts taken from normal 
subjects, either animal or human, with a sequence, or a fragment thereof, encoding 
NTPPH-2, under conditions suitable for hybridization or amplification. Standard 
hybridization may be quantified by comparing the values obtained from normal subjects 

25 with values from an experiment in which a known amount of a substantially purified 
polynucleotide is used. Standard values obtained from normal samples may be compared 
with values obtained from samples from patients who are symptomatic for a disorder. 
Deviation from standard values is used to establish the presence of a dison|er. 

Once the presence of a disorder is established and a treatment protocol is initiated, 

30 hybridization assays may be repeated on a regular basis to evaluate whether the level of 
expression in the patient begins to approximate that which is observed in the normal 
subject. The results obtained from successive assays may be used to show the efficacy of 
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treatment over a period ranging from several days to months. 

With respect to cancer, the presence of a relatively high amount of transcript in 
biopsied tissue from an individual may indicate a predisposition for the development of 
the disease, or may provide a means for detecting the disease prior to the appearance of 
5 actual clinical symptoms. A more definitive diagnosis of this type may allow health 
professionals to employ preventative measures or aggressive treatment earlier thereby 
preventing the development or further progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences 
encoding NTPPH-2 may involve the use of PCR. Such oligomers may be chemically 

10 synthesized, generated enzymatically, or produced in vitro . Oligomers will preferably 
consist of two nucleotide sequences, one with sense orientation (5* to 3') and another with 
antisense orientation (3* to 5')> employed under optimized conditions for identification of 
a specific gene or condition. The same two oligomers, nested sets of oligomers, or even a 
degenerate pool of oligomers may be employed under less stringent conditions for 

1 5 detection and/or quantitation of closely related DNA or RNA sequences. 

Methods which may also be used to quarititate the expression of NTPPH-2 include 
radiolabeling or biotinylating nucleotides, coamplification of a control nucleic acid, and 
interpolating results from standard curves. (Melby, P.C. et al. (1993) J. Immunol. 
Methods 159:235-244, and Duplaa, C. et al (1993) Anal. Biochem. 229-236.) The speed 

20 of quantitation of multiple samples may be accelerated by running the assay in an ELISA 
format where the oligomer of interest is presented in various dilutions and a 
spectrophotometric or colorimetric response gives rapid quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of 
the polynucleotide sequences described herein may be used as targets in a microarray . The 

25 microarray can be used to monitor the expression level of large numbers of genes 

simultaneously (to produce a transcript image) and to identify genetic variants, mutations; 
and polymorphisms. This information may be used in determining gene function, in 
understanding the genetic basis of a disorder, in diagnosing a disorder, and in developing 
and monitoring the activities of therapeutic agents. 

30 In one embodiment, the microarray is prepared and used according to methods 

known in the art, such as those described in published PCT application W095/1 1995 
(Chee et al.), Lockhart, D. J. et al. (1996; Nat. Biotech. 14:1675-1680), and Schena, M. et 
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al. (1996; Proc. Natl. Acad. Sci. 93:10614-10619.) 

The microarray is preferably composed of a large number of unique single- 
stranded nucleic acid sequences, usually either synthetic antisense oligonucleotides or 
fragments of cDNAs, fixed to a solid support. The oligonucleotides are preferably about 6 
5 to 60 nucleotides in length, more preferably about 1 5 to 30 nucleotides in length, and most 
preferably about 20 to 25 nucleotides in length. For a certain type of microarray, it may be 
preferable to use oligonucleotides which are about 7 to 10 nucleotides in length. The 
microarray may contain oligonucleotides which cover the known 5 1 or 3' sequence, or may 
contain sequential oligonucleotides which cover the full length sequence or unique 

10 oligonucleotides selected from particular areas along the length of the sequence. 

Polynucleotides used in the microarray may be oligonucleotides specific to a gene or genes 
of interest in which at least a fragment of the sequence is known or oligonucleotides 
specific to one or more unidentified cDNAs common to a particular cell or tissue type or 
to a normal, developmental, or disease state. In certain situations, it may be appropriate to 

1 5 use pairs of oligonucleotides on a microarray. The pairs will be identical, except for one 
nucleotide preferably located in the center of the sequence. The second oligonucleotide in 
the pair (mismatched by one) serves as a control. The number of oligonucleotide pairs 
may range from about 2 to 1 ,000,000. 

In order to produce oligonucleotides to a known sequence for a microarray, the 

20 gene of interest is examined using a computer algorithm which starts at the 5' end, or, 
more preferably, at the 3 f end of the nucleotide sequence. The algorithm identifies 
oligomers of defined length that are unique to the gene, have a GC content within a range 
suitable for hybridization, and lack predicted secondary structure that may interfere with 
hybridization. In one aspect, the oligomers are synthesized at designated areas on a 

25 substrate using a light-directed chemical process. The substrate may be paper, nylon, any 
other type of membrane, filter, chip, glass slide, or any other suitable solid support. 

In one aspect, the oligonucleotides may be synthesized on the surface of the 
substrate by using a chemical coupling procedure and an ink jet application apparatus, 
such as that described in published PCT application W095/251 1 16 (Baldeschweiler et al.). 

30 In another aspect, a grid array analogous to a dot or slot blot (HYBRIDOT® apparatus, 
GlBCO/BRL) may be used to arrange and link cDNA fragments or oligonucleotides to the 
surface of a substrate using a vacuum system or thermal, UV, mechanical or chemical 
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bonding procedures. In yet another aspect, an array may be produced by hand.or by using 
available devices, materials, and machines (including Brinkmann® multichannel pipettors 
or robotic instruments), and may contain 8, 24, 96, 384, 1536, or 6144 oligonucleotides, or 
any other multiple from 2 to 1 ,000,000 which lends itself to the efficient use of 
5 commercially available instrumentation. 

In order to conduct sample analysis using the microarrays, polynucleotides are 1 
extracted from a biological sample. The biological samples may be obtained from any 
bodily fluid (blood, urine, saliva, phlegm, gastric juices, etc.), cultured cells, biopsies, or 
other tissue preparations. To produce probes, the polynucleotides extracted from the 
10 sample are used to produce nucleic acid sequences which are complementary to the . 
nucleic acids on microaiT ay. If the microarray consists of cDNAs, antisense RNAs 
(aRNA) are appropriate probes. Therefore, in one aspect, mRNA is used to produce 
cDNA which, in turn and in the presence of fluorescent nucleotides, is used to produce 
fragment or oligonucleotide aRNA probes. These fluorescently labeled probes are 
1 5 incubated with the microarray so that the probe sequences hybridize to the cDNA ■ 
oligonucleotides of the microarray. In another aspect, nucleic acid sequences used as 
probes can include polynucleotides, fragments, and complementary or antisehse sequences 
produced using restriction enzymes, PCR technologies, and Oligolabeling or TransProbe 
kits (Pharmacia & Upjohn) well known in the area of hybridization technology. 
20 Incubation conditions are adjusted so that hybridization occurs with precise 

complementary matches or with various degrees of less complementarity. After removal 
of nonhybridized probes, a scanner is used to determine the levels and patterns of 
fluorescence. The scanned images are exarnined to determine the degree of 
complementarity and the relative abundance of each oligonucleotide sequence on the 
25 microarray. A detection system may be used to measure the absence, presence, and 

amount of hybridization for all of the distinct sequences simultaneously. This data may be 
used for large scale correlation studies or for functional analysis of the sequences, 
mutations, variants, or polymorphisms among samples. (Heller, R.A. et al. (1997) Proc. 
Natl. Acad. Sci. 94:2150-2155.) 
30 In another embodiment of the invention, nucleic acid sequences encoding NTPPH- 

2 may be used to generate hybridization probes useful for mapping the naturally occurring 
genomic sequence. The sequences may be mapped to a particular chromosome, to a 

-43- 



<WO 993261 OA 1_l_> 



WO 99/32610 PCTAJS98/25558 

specific region of a chromosome, or to artificial chromosome constructions, such as 
human artificial chromosomes (HACs), yeast artificial chromosomes (YACs), badterial 
artificial chromosomes (BACs), bacterial PI constructions, or single chromosome cDNA 
libraries, such as those reviewed in Price, CM. (1993; Blood Rev. 7:127-134) and Trask, 
5 B.J. (1991; Trends Genet 7:149-154.) 

Fluorescent in situ hybridization (FISH, as described in Verma et al. (1 988) Human 
Chromosomes: A Manual of Basic Techniques- Pergamon Press, New York, NY) may be 
correlated with other physical chromosome mapping techniques and genetic map data. 
Examples of genetic map data can be found in various scientific journals or at the Online 

10 Mendelian Inheritance in Man (OMIM) site. Correlation between the location of the gene 
encoding NTPPH-2 on a physical chromosomal map and a specific disorder, or 
predisposition to a specific disorder, may help define the region of DNA associated with 
that disorder. The nucleotide sequences of the subject invention may be used to detect 
differences in gene sequences between normal, carrier, and affected individuals. 

15 In situ hybridization of chromosomal preparations and physical mapping 

techniques, such as linkage analysis using established chromosomal markers, may be used 
for extending genetic maps. Often the placement of a gene on the chromosome of another 
mammalian species, such as mouse, may reveal associated markers even if the number or 
arm of a particular human chromosome is not known. New sequences can be assigned to 

20 chromosomal arms, or parts thereof, by physical mapping. This provides valuable 

information to investigators searching for disease genes using positional cloning or other 
gene discovery techniques. Once the disease or syndrome has been crudely localized by 
genetic linkage to a particular genomic region, e.g., AT to 1 1 q22-23 (Gatti, R. A. et al. 
(1988) Nature 336:577-580), any sequences mapping to that area may represent associated 

25 or regulatory genes for further investigation. The nucleotide sequence of the subject 
invention may also be used to detect differences in the chromosomal location due to 
translocation, inversion, etc., among normal, carrier, or affected individuals. 

In another embodiment of the invention, NTPPH-2, its catalytic or immunogenic 
fragments, or oligopeptides thereof can be used for screening libraries of compounds such 

30 as agonists or antagonists in any of a variety of drug screening techniques. The fragment 
employed in such screening may be free in solution, affixed to a solid support, borne on a 
cell surface, or located intracellularly. The formation of binding complexes between 
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NTPPH-2 and the agent being tested may be measured. 

The method provided for screening a library of small molecules to identify a 
molecule which binds NTPPH-2 comprises: a) providing a library of small molecules; b) 
combining the library of small molecules with the polypeptide of SEQ ID NO:2 or a 
5 fragment of SEQ ID NO:2 under conditions which are suitable for complex formation; and 
c) detecting complex formation wherein the presence of the complex identifies a small 
molecule which binds NTPPH-2. The method for identifying one of these small molecules 
which binds NTPPH-2 as an agonist comprises delivering a small molecule of claim 25 
and gamma labeled ATP into cells transformed with a vector expressing NTPPH-2, 

10 growing the cells under suitable conditions, and assaying for Ppi. An increased amount of 
PPi establishes that the small molecule is an agonist which increases NTPPH-2 activity. 
The method for identifying an antagonist comprises delivering a small molecule and 
gamma labeled ATP into cells transformed with a vector expressing NTPPH-2, growing 
the cells under suitable conditions, and assaying the media for Ppi. A reduced amount of 

15 PPi establishes that the small molecule is an antagonist which reduces NTPPH-2 activity. 
Another technique for drug screening which may be used provides for high 
throughput screening of compounds having suitable binding affinity to the protein of 
interest as described in published PCT application WO84/03564. In this method, large 
numbers of different small test compounds are synthesized on a solid substrate, such as 

20 plastic pins or some other surface. The test compounds are reacted with NTPPH-2, or 
fragments thereof, and washed. Bound NTPPH-2 is then detected by methods well known 
in the art. Purified NTPPH-2 can also be coated directly onto plates for use in the 
aforementioned drug screening techniques. Alternatively, non-neutralizing antibodies can 
be used to capture the peptide and immobilize it on a solid support. 

25 In another embodiment, one may use competitive drug screening assays in 

neutralizing antibodies capable of binding NTPPH-2 specifically compete with a test 
compound for binding NTPPH-2. In this manner, antibodies can be used to detect the 
presence of any peptide which shares one or more antigenic determinants with NTPPH-2. 
In additional embodiments, the nucleotide sequences which encode NTPPH-2 may 

30 be used in any molecular biology techniques that have yet to be developed, provided the 
new techniques rely on properties of nucleotide sequences that are currently known, 
including, but not limited to, such properties as the triplet genetic code and specific base 
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pair interactions. 

The examples below are provided to illustrate the subject invention and ari not 
included for the purpose of limiting the invention. 

5 EXAMPLES 
I SATPF 1008 cDNA Library Construction 

The SATPF 1008 cDNA library was made from cartilage tissue obtained from four 
donors with end stage osteoarthritis. The osteoarthritic patients had received steroids such 
as prednisone and a variety of non-steroidal anti-inflammatory drugs. There was no 

10 stimulation with IL-1 (Pfizer Inc., Groton, CT). 

The osteoarthritic cartilage tissue was harvested at joint replacement surgery and 
placed in Dulbecco's Modified Eagle Medium (D-MEM; Gibco/BRL) supplemented with 
antibiotics (penicillin, streptomycin, and gentamicin) and transported to Pfizer 
laboratories. The cartilage was removed aseptically from the underlying bone, rinsed in 

15 D-MEM and diced into small pieces (^4 mm 2 ), and placed in 100 mm petri dishes 
containing 20 ml of Neumari and Tytell's serum free medium (Gibco/BRL). Using the 
protocol of Mitchell et al. (1996) J. Clin. Invest. 97:761-768, the cartilage from each 
patient was digested with 4 mg/ml pronase (Sigma, St Louis, MO) for 1 .5 hours, then 
subsequently digested with 3 mg/ml bacterial collagenase (Sigma) for 1 .5 hours. The 

20 digested cartilage was filtered through a cell strainer to remove undigested material, and 
the cells were pelleted by centrifugation. The cell pellet was washed once with phosphate 
buffered saline (PBS) and then dissolved in 5 ml of buffer consisting of 5 M guanidine 
isothiocyanate, 10 mM 1SDTA, 50 mM Tris (pH 7.5) and 8% B mercaptoethanol. A five- 
fold volume of 4M LiCl was added to the buffer, and the mixture was stored in the 

25 refrigerator overnight. After centrifugation, the precipitate was washed once with 3 M 
LiCl and recentrifuged. The second precipitate was dissolved in a solution consisting of 
0.1% sodium dodecyl sulfate, 1 mM EDTA and 10 mM Tris (pH 7.5). The suspension 
"was "frozen att -7CF C and then vdrtexed during thawing (Cathala et al (1983"; DNA 2:329- 
335). 

30 Total RNA was extracted twice with phenol chloroform, once with chloroform, and 

then, precipitated with ethanol. Equal amounts of RNA from the four donors were 
combined, ethanol precipitated, resulting in 1 12 pooled RNA. Following 
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centrifiigation, the RNA pellet was redissolved in DEPC-treated, distilled, deionized water 
(DEPC-ddH 2 0) and run over a CsCl gradient. The RNA was extracted with acid phenol 
(IX at pH 4.0, catalog #972Z, Ambion, Austin, TX), precipitated with ethanol and 
resuspended in DEPC-ddH 2 0. The RNA was treated with RNAse-free DNAse (Epicentre 
5 Technologies, Madison, WI) for 15 minutes, extracted with chloroform, precipitated and 
washed with ethanol, and dissolved in DEPC-ddH 2 0. 

The mRNA was handled according to the recommended protocols in the 
Superscript Plasmid System for cDNA synthesis and plasmid cloning (Cat. #18248-013, 
Gibco/BRL). The cDNAs were fractionated on a Sepharose CL4B column (Cat. 
10 #275 1 05-0 1 ; Pharmacia), and those cDNAs exceeding 4Q0 bp were ligated into pINC Y 1 
(Incyte). The plasmid pINCY 1 was subsequently transformed into DH5«™ competent 
cells (Cat #18258-012; Gibco/BRL), 
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El Isolation and Sequencing of cDNA Clones 

Plasmid DNA was released from the cells and purified using the REAL Prep 96 
plasmid kit (Catalog #261 73, QIAGEN, Inc., Chatsworth, CA). The recommended 
protocol was employed except for the following changes: 1 ) the bacteria were cultured in 1 
5 ml of sterile Terrific Broth (Catalog #2271 1, GIBCO/BRL ) with carbenicillin at 25 mg/L 
and glycerol at 0.4%; 2) after inoculation, the cultures were incubated for 19 hours and at 
the end of incubation, the cells were lysed with 0.3 ml of lysis buffer, and 3) following 
isopropanol precipitation, the plasmid DNA pellet was resuspended in 0. 1 ml of distilled 
water. After the last step in the protocol, samples were transferred to a 96-well block for 
10 storage at 4° C. 

The cDNAs were sequenced by the method of Sanger, et al. (1 975, J, Mol. Biol. 
94:441f), using a Hamilton Micro Lab 2200 (Hamilton, Reno, NV) in combination with 
Peltier Thermal Cyclers (PTC200 from MJ Research, Watertown, MA) and Applied 
Biosystems 377 DNA Sequencing Systems; and the reading frame was determined. 

15 

III Homology Searching of cDNA Clones and Their Deduced Proteins 

The nucleotide sequences and/or amino acid sequences of the Sequence Listing , 
were used to query sequences in the GenBank, SwissProt, BLOCKS, and Pima II 
databases. These databases, which contain previously identified and annotated sequences, 

20 were searched for regions of homology using BLAST (Basic Local Alignment Search 
Tool). (Altschul, S.F. (1993) J. Mol. Evol 36:290-300; and Altschul et al. (1990) J. Mol. 
Biol. 215:403-410.) 

BLAST produced alignments of both nucleotide and amino acid sequences to 
determine sequence similarity. Because of the local nature of the alignments, BLAST was 

25 especially useful in determining exact matches or in identifying homologs which may be 
ofprokaryotic (bacterial) or eukaryotic (animal, fungal, or plant) origin. Other algorithms 
such as the one described in Smith, T. et al. (1992; Protein Engineering 5:35-51), could 
have been used when dealing with primary sequence patterns and secondary structure gap 
penalties. The sequences disclosed in this application have lengths of at least 49 

30 nucleotides and have no more than 1 2% uncalled bases (where N is recorded rather than A, 
C, G, orT). 

. The BLAST approach searched for matches between a query sequence and a 
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database sequence. BLAST evaluated the statistical significance of any matches found, 
and reported only those matches that satisfy the user-selected threshold of significance. In 
this application, threshold was set at 1 0 23 for nucleotides and i0 :, ° for peptides; 

Incyte nucleotide sequences were searched against the GenBank databases for 
5 primate (pri), rodent (rod), and other mammalian sequences (mam), and deduced amino 
acid sequences from the same clones were then searched against GenBank functional 
protein databases, mammalian (mamp), vertebrate (vrtp), and eukaryote (eukp), for 
homology. 

10 IV Cloning of Full Length NTPPH-2 

A 2.3 kb cDNA sequence which encodes a partial porcine NTPPH (L.M. Ryan, 
Medical College of Wisconsin, Milwaukee, WI) was searched against NCBI public EST 
and in-house databases. Homologous sequences were found in the osteoarthritic cartilage 
libraries. 

15 A 4.1 kb sequence was identified in using the chondrocyte library and the cDNA 

insert from Incyte Clone 1423393 as a hybridization probe. When a 700 bp restriction 
fragment from the 5" most coding region of the 4. 1 kb clone was used to rescreen the 
osteoarthritic cartilage library, the full length cDNA encoding NTPPH-2 was identified. 
The full length cDNA was sequenced and found to contain an appropriate Kozak initiation 

20 and signal sequence. The cloned polynucleotide sequence was deposited with The 
American Type Culture Collection as Accession No. on (date). 

V Northern Analysis 

Human multiple tissue northern blots were obtained from Clontech (Palo Alto, 

25 CA). Human cartiiage was treated with +/- human recombinant IL-1«. ForRNA 

preparation, chondrocytes were isolated via sequential digestion for 1 .5 hours with pronase 
(4 mg/ml) followed by bacterial collagenase (3 mg/ml) for 3-5 hours. The chondrocytes 
were pelleted, and the cells lysed in guanidinium isothiocyanate. RNA was precipitated 
with LiCl as described in Mitchell et al. (supm.) Northern blot analysis was carried out 

30 using DNA probes labeled with a random primer kit (Pharmacia Biotech Inc. Piscataway, 
NJ). The blots were hybridized overnight at 42C essentially as described in Sambrook et 
al. (supm); then washed three times in 3xSSC/0.1%SDS at room temperature and once in 
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0.3x SSC/0.1% SDS at 60C for 15 min. 

Membrane-based northern analyses of human, dog and rabbit joint tissue RNA 
samples demonstrated the highest levels of NTPPH-2 mKNA expression in cartilage and 
lower, but significant, expression levels in testes, trachea, and bone marrow. 
5 Computer techniques analogous to northern analysis were also performed using 

BLAST. (Altschul (1993) sup©, Altschul (1990) §upm.) The basis of the search is the 
product score which is defined as: 

% sequence identity x % maximum BLAST score 
100 

10 The product score takes into account both the degree of similarity between two sequences 
and the length of the sequence match. For example, with a product score of 40, the match 
will be exact within a 1-2% error, and at 70^ the match will be exact. Homologous 
molecules are usually identified by selecting those which show product scores between 1 5 
and 40, although lower scores may identify related molecules. 

15 Electronic northern analysis shows the expression of this sequence in varibus 

libraries at least 57% of which involve immunological response and at least 26% of which 
are immortalized or cancerous. Of particular note is the expression of NTPPH-2 in 
rheumatoid and bsteoarthritic synovial, chondrocyte, and tibial libraries. 

20 VI Extension of NTPPH-2 Encoding Polynucleotides 

The nucleic acid sequence encoding NTPPH-2 was used to design oligonucleotide 
primers for obtaining 5' regulatory sequences using an appropriate genomic library. One 
primer was synthesized to initiate extension in the antisense direction, and the other was 
synthesized to extend sequence in the sense direction; Primers were used to facilitate the 

25 extension of the known sequence "butwiard" generating amplicons containing new, 

unknown nucleotide sequence for the region of interest. The initial primers were designed 
from the cDNA using OLIGO 4.06 (National Biosciences), or another appropriate 
program, to be about 22 to about 30 nucleotides in length, to have a GC content of 50% or 
more, and to anneal to the target sequence at temperatures of about 68 °to about 72° C. 

30 Any stretch of nucleotides which would result in hairpin structures and primer-primer 
dimerizations was avoided. 

Selected human cDNA libraries (GlBCO/BRL) were used to extend the sequence 
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If more than one extension is necessary or desired, additional sets of primers are designed 
to further extend the known region. j 

High fidelity amplification was obtained by following the instructions for the XL- 
PCR kit (Perkin Elmer) and thoroughly mixing the enzyme and reaction mix. Beginning 
5 with 40 pmol of each primer and the recommended concentrations of all other components 
of the kit, PCR was performed using the Peltier Thermal Cycler (PTC200; M.J. Research, 
Watertown, MA) and the following parameters: 

Step 1 94° C for 1 min (initial denaturation) 

Step 2 65° C fori min 

10 Step 3 68° C for 6 min 

Step4 94°Cforl5sec 

— : Step-S 65° C fori min : '. 

Step 6 68° C for 7 min 

Step 7 Repeat step 4-6 for 15 additional cycles 

15 Step 8 94° C for 15 sec 

Step 9 65° C fori min 

Step 10 68° C for 7:15 min 

. Step 11 Repeat step 8-10 for 12 cycles 

Step 12 72° C for 8 min 

20 Step 13 4° C (and holding) 

A 5-10 /^l aliquot of the reaction mixture was analyzed by. electrophoresis on a 

low concentration (about 0.6-0.8%) agarose mini-gel to determine which reactions were 

successful in extending the sequence. Bands thought to contain the largest products were 

excised from the gel, purified using QIAQuick™ (QIAGEN), and trimmed of overhangs 

25 using Klenow enzyme to facilitate religation and cloning. 

After ethanol precipitation, the products were redissolved in 13 fA of ligation 
buffer, T4-DNA ligase (1 5 units) and 1^1 T4 polynucleotide kinase were added, and 
the mixture was incubated at room temperature for 2-3 hours or overnight at 1 6° C. 
Competent & £fili cells (in 40 fA of appropriate media) were transformed with 3 fA of 

30 ligation mixture and cultured in 80 tA of SOC medium (Sambrook et al., supra .) After 
incubation for one hour at 37° C, the E» edi mixture was plated on Lima Bertani (LB)- 
agar (Sambrook et al., supra') containing 2x Carb. The following day, several colonies 
were randomly picked from each plate and cultured in 150 fA of liquid LB/2x Carb 
medium placed in an individual well of an appropriate, commercially-available, sterile 96- 

35 well microtiter plate. The following day, 5 fA of each overnight culture was transferred 
into a non-sterile 96-well plate and after dilution 1:10 with water, 5 fA of each sample was 
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transferred into a PCR array. 

For PCR amplification, 18 ij\ of concentrated PCR reaction mix (3.3x) containing 
4 units of rTth DNA polymerase, a vector primer, and one or both of the gene specific 
primers used for the extension reaction were added to each well. Amplification was 
5 performed using the following conditions: 



Step 1 94° C for 60 sec 

Step2 94°Cfor20sec 

Step 3 55° C for 30 sec 

Step 4 72° C for 90 sec 

10 Step 5 Repeat steps 2-4 for an additional 29 cycles 

Step 6 72° C for 180 sec 

Step 7 4° C (and holding) 



Aliquots of the PCR reactions were run on agarose gels together with molecular 
weight markers. The sizes of the PCR products were compared to the original partial 
15 cDNAs, and appropriate clones were selected, ligated into plasmid, and sequenced. 

VII Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:2 are employed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of , 

20 about 20 base-pairs, is specifically described, essentially the same procedure is used with 
larger nucleotide fragments. Oligonucleotides are designed using state-of-the-art software 
such as OLIGO 4.06 (National Biosciences), labeled by combining 50 pmbl of each 
oligomer and 250 of [y- 32 P] adenosine triphosphate (Amersham) and T4 
polynucleotide kinase (DuPont NEN®, Boston, MA). The labeled oligonucleotides are 

25 substantially purified with Sephadex G-25 superfine resin column (Pharmacia & Upjohn). 
A aliquot containing 1 0 7 counts per minute of the labeled probe is used in a typical 
membrane-based hybridization analysis of human genomic DNA digested with one of the 
following endonucleases (Ase I, Bgl II, Eco RI, Pst I, Xba 1, or Pvu II; DuPont NEN®). 
The DNA from each digest is fractionated on a 0.7 percent agarose gel and 

30 transferred to nylon membranes (Nytran Plus, Schleicher & Schuell, Durham, NH). 
Hybridization is carried out for 16 hours at 40°C. To remove nonspecific signals, blots 
are sequentially washed at room temperature under increasingly stringent conditions up to 
0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. After XOMAT AR™ film 
(Kodak, Rochester, NY) is exposed to the blots in a Phosphoimager cassette (Molecular 
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Dynamics, Sunnyvale, CA) for several hours, hybridization patterns are compared 
visually. I 

VIII Microarrays 

5 To produce oligonucleotides for a micrparray, the nucleotide sequence described 

herein is examined using a computer algorithm which starts at the 3' end of the nucleotide 
sequence. The algorithm identifies oligomers of defined length that are unique to the 
gene, have a GC content within a range suitable for hybridization, and lack predicted 
secondary structure that would interfere with hybridization. The algorithm identifies 20 

10* sequence-specific oligonucleotides of 20 nucleotides in length (20-mers). A matched set 
of oligonucleotides is created in which one nucleotide in the center of each sequence is 
altered. This process is repeated for each gene in the microarray, and double sets of 
twenty 20 mers are synthesized and arranged on the surface of the silicon chip using a 
light-directed chemical process (Chee, M. et al., PCT/W095/1 1 995). 

15 In the alternative, a chemical coupling procedure and an ink jet device are used to 

synthesize oligomers on the surface of a substrate (Baldeschweiler, J.D. et al., 
PCT/W095/25 1 1 6). In another alternative, a "gridded" array analogous to a dot (or slot) 
blot is used to arrange and link cDNA fragments or oligonucleotides to the surface of a 
substrate using a vacuum system, thermal, UV, mechanical or chemical bonding 

20 procedures. An array may be produced by hand or using available materials and machines 
and contain grids of 8 dots, 24 dots, 96 dots, 384 dots, 1 536 dots or 6144 dots. After 
hybridization, the microarray is washed to remove nonhybridized probes, and a scanner is 
used to determine the levels and patterns of fluorescence. The scanned images are 
examined to determine degree of complementarity and the relative abundance of each 

25 oligonucleotide sequence on the micro-array. 

IX Complementary Polynucleotides 

Sequence complementary to the NTPPH-2-encoding sequence, or any part 
thereof, is used to decrease or inhibit expression of naturally occurring NTPPH-2. 
30 Although use of oligonucleotides comprising from about 15 to about 30 base-pairs is 
described, essentially the same procedure is used with smaller or larger sequence 
fragments. Appropriate oligonucleotides are designed using Oligo 4.06 software and the 
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coding sequence of NTPPH-2, SEQ ID NO: 1 . To inhibit transcription, a complementary 
oligonucleotide is designed from the most unique 5' sequence and used to prevent 
promoter binding to the coding sequence. To inhibit translation, a complementary 
oligonucleotide is designed to prevent ribosomal binding to the NTPPH-2-encoding 
5 transcript. 

X Expression of NTPPH-2 

The cDNA encoding NTPPH-2 is used to express both full-length and truncated 
forms of recombinant NTPPH. Expression of NTPPH-2 is accomplished by subclonirig 
10 the cDNAs into appropriate vectors and transforming the vectors into host cells. In this 
case, the cloning vector was used to express NTPPH-2 in the baculovirus Fast-B AC 
system (GlBCO/BRL). Upstream of the cloning site, this vector contains a promoter for 
polyhedron coat protein. Infection of an insect cell line such as SF9 with the recombinant 
baculovirus results iii the expression of NTPPH-2. Signal residues direct the secretion of 

15 NTPPH-2 into the culture media which can be used directly in the following assay for 

« . . . 

activity. 

XI Demonstration of NTPPH-2 Activity 

Human nucleotide pyrophosphohydrolase-2 activity is analyzed using thymidine 
20 monophosphate paranitrophenyl ester or t 32 ?] gamma labeled ATP as substrate. Media are 
chromatographed and peak fractions are analyzed kinetically as described in Cardenal, A. 
et al. (1996; Arthritis Rheum, 39:252-256.) 

XII Production of NTPPH-2 Specific Antibodies 

25 The amino acid sequence deduced from the cDNA encoding NTPPH-2 is 

analyzed using DNASTAR software (DNASTAR, Inc.) to determine regions of high 
immunogenicity and a corresponding oligopeptide is synthesized and used to raise anti- 
NTPPH-2 antibodies. The selection of appropriate peptide sequences and the techniques 
for antibody production by means known to those of skill in the art. Selection of 

30 appropriate epitopes, such as those near the C-terminus or in hydrophilic regions, is 
described by Ausubel et al. (supra), and others. 

Typically, the oligopeptides are 1 5 residues in length, synthesized using an 
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Applied Biosystems Peptide Synthesizer Model 43 1 A using ftnoc-chemistry, and coupled 
to keyhole limpet hemocyanin (KLH, Sigma, St Louis, MO) by reaction with N- : 
maleimidobenzoyl-N-hydroxysuccinimide ester. (MBS; Ausubel et ah, supra ,) Rabbits are 
immunized with the oligopeptide-KLH complex in complete Freund ! s adjuvant The 
5 resulting antisera are tested for antipeptide activity, e.g., by binding the peptide to plastic, 
blocking with 1% BSA, reacting with rabbit antisera, washing, and reacting with radio 
iodinated, goat anti-rabbit IgG. 

Xm Purification of Naturally Occurring NTPPH-2 Using Specific Antibodies 

1 0 Naturally occurring or recombinant NTPPH-2 is substantially purified by 

immunoaffinity chromatography using antibodies specific tor NTPPH-2. An 
immunoaffinity column is constructed by covalently coupling NTPPH-2 antibody to an 
activated chromatographic resin, such as CNBr-activated Sepharose (Pharmacia & 
Upjohn). After the coupling, the resin is blocked and washed according to the 

15 manufacturer's instructions. 

Media containing NTPPH-2 is passed over the immunoaffinity column, and the 
column is washed under conditions that allow the preferential absorbance of NTPPH-2 
(e.g., high ionic strength buffers in the presence of detergent). The column is eluted under 
conditions that disrupt antibody/NTPPH-2 binding (eg, a buffer of pH 2-3 or a high 

20 concentration of a chaotrope, such as urea or thiocyanate ion), and NTPPH-2 is collected. 

XIV Identification of Molecules Which Interact with NTPPH-2 

NTPPH-2 or biologically active fragments thereof are labeled with 125 I 
. Bolton-Hunter reagent (Bolton, et al. (1973) Biochem. J. 133: 529.) Candidate molecules 
25 previously arrayed in the wells of a multi-well plate are incubated with the labeled 
NTPPH-2, washed and any wells with labeled NTPPH-2 complex are assayed. Data 
obtained using different concentrations of NTPPH-2 are used to calculate values for the 
number, affinity, and association of NTPPH-2 with the candidate molecules. 

Various modifications and variations of the described method and system of the 
30 invention will be apparent to those skilled in the art without departing from the scope and 
spirit of the invention. Although the invention has been described in connection with 
specific preferred embodiments, it should be understood that the invention as claimed 
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should not be unduly limited to such specific embodiments. Indeed, various modifications 
of the described modes for carrying out the invention which are obvious to those skilled in 
molecular biology or related fields are intended to be within the scope of the following 
claims. 
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What is claimed is: 

1 . A substantially purified human nucleotide pyrophosphohydrolase-2 

(NTPPH-2) comprising the amino acid sequence of SEQ ID NO:l or a fragment of SEQ 
5 IDNO:l. 

2. A substantially purified variant of NTPPH-2 having at least 90% amino 
acid identity to the amino acid sequence of claim 1. 

10 3. An isolated and purified polynucleotide encoding the NTPPH-2 of claim 1 . 



4. An isolated and purified polynucleotide variant having at least 90% 
polynucleotide identity to the polynucleotide of claim 3 . 

15 5. A composition comprising the polynucleotide of claim 3. 

6. An isolated and purified polynucleotide which hybridizes under stringent 
conditions to me polynucleotide of claim 3. 

20 7 ; An isolated and purified polynucleotide which is complementary to the 

polynucleotide of claim 3. 

8. An isolated and purified polynucleotide comprising SEQ ID NO:2 or a 
fragment of SEQ ID NO:2; 

25 

9. An isolated and purified polynucleotide of claim 8 which is deposited as 
Accession No. in the American Type Culture Collection. 

10. A fragment ofme polynucleotide ofclaim 8, wherein the fragment is 
30 selected from the group essentially consisting of: 

(a) nucleotides 55 through 75 of SEQ ID NO:2; 

(b) nucleotides 481 through 507 of SEQ ID NO:2; 

-57- 



<WO_893281QA1J_= 



I 



WO 99/32610 PCT/US98/25558 

(c) nucleotides 646 through 669 of SEQ ID NO:2; 

(d) nucleotides 2182 through 4149 of SEQ ID NO:2; 1 

(e) nucleotides 1726 through 4149 of SEQ ID NO:2; 

(f) nucleotides 757 through 4149 of SEQ ID NO:2; and 
5 (g) nucleotides 113 through 4149 of SEQ ID NO:2. 

11. An isolated and purified polynucleotide variant having at least 90% 
polynucleotide identity to the polynucleotide of claim 8. 

io 12. An isolated and purified polynucleotide which is complementary to the 

polynucleotide of claim 8. 

13. An expression vector containing at least a fragment of the polynucleotide of 
claim 3. 

15 

14. A host cell containing the expression vector of claim 13, 

15. A method for producing a polypeptide comprising the amino acid sequence 
of SEQ ID NO:l or a fragment of SEQ ID NO:l, the method comprising the steps of: 

20 (a) culturing the host cell of claim 14 under conditions suitable for the 

expression of the polypeptide; and 

(b) recovering the polypeptide from the host cell culture. 

16. A pharmaceutical composition comprising the NTPPH-2 of claim 1 and a 
25 suitable pharmaceutical carrier. 

17. A purified antibody which specifically binds to the NTPPH-2 of claim 1 . 

18. A purified agonist of the NTPPH-2 of claim 1. 

30 

19. A purified antagonist of the NTPPH-2 of claim 1 . 
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20 - A method for treating an arthropathy comprising administering to a subject 
in need of such treatment an effective amount of the antagonist of claim 18. 

21. A method for treating an immunological disorder comprising administering 
5 to a subject in need of such treatment an effective amount of the antagonist of claim 19. 

22. A method for treating a cancer comprising achhinistering to a subject in 
need of such treatment an effective amount of the antagonist of claim 19. 

1 0 23 - A method for detecting a polynucleotide encodmg.NTPPH-2 in a biological 



sample, the method comprising the steps of: 

(a) hybridizing the polynucleotide of claim 7 to at least one of the 
nucleic acids of the biological sample, thereby forming a hybridization complex; 
and 

15 00 detecting the hybridization complex, wherein the presence of the 

hybridization complex correlates with the presence of a polynucleotide encoding 
NTPPH-2 in the biological sample. 

... • » 

24. The method of claim 23 wherein the nucleic acids of the biological sample 
20 are amplified by the polymerase chain reaction prior to the hybridizing step. 

25. A method for detecting NTPPH-2 in a biological sample, the method 
comprising the steps of: 

(a) combining the biological sample and an antibody of claim 1 7 under 
25 suitable conditions for complex formation; and 

(b) detecting complex formation between NTPPH-2 and the antibody, ' 
thereby establishing the presence of NTPPH-2 in the biological sample. 



30 26. A method for screening a library of small molecules to identify a molecule 

Which binds NTPPH-2, the method comprising the steps of: 

(a) combining the library of small molecules with the polypeptide of 
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claim 1 5 under suitable conditions for complex formation; and 

(b) detecting complex formation, wherein the presence of the colnplex 
identifies a small molecule which binds NTPPH-2 

27. A method for identifying an agonist, the method comprising the steps of: 

(a) delivering one of the small molecules of claim 26 and gamma 
labeled ATP into cells transformed with a vector expressing NTPPH-2; 

(b) growing the cells under suitable conditions; and 

(c) assaying for an increased amount of PPi, thereby establishing that 
the small molecule is an agonist 

28. A method for identifying an antagonist, the method comprising the steps of: 
(a) delivering one of the small molecules, of claim 26 and gamma 

labeled ATP into cells transformed with a vector expressing NTPPH-2; 
15 (b) growing the cells i under suitable conditions; and 

(c) assaying for a decreased amount of PPi, thereby establishing that 
the small molecule is an antagonist. 



5 
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<130> PF-0420 PCT 



<140> 


To Be Assigned 


<141> 


Herewith 


<150> 


08/996,083 


<151> 


1997-12-22 


'<160> 


3 ' 


. <170> 


PERL PROGRAM 


<210> 


1 


. <211> 


1156 


<212> 


PRT 


<213> 


Homo sapiens 


<220> 




<223> 


1388013 


<400> 


1 



Met Ala Ser Leu Leu Pro Leu Leu Cys Leu Cys Val Val Ala Ala 
1 5 10 15 

His Leu Ala Gly Ala Arg Asp Ala Thr Pro Thr Glu Glu Pro Met 
20 25 30 

Ala Thr Ala Leu Gly Leu Glu Arg Arg Ser Val Tyr Thr Gly Gin 
35 40 45 

Pro Ser Pro Ala Leu Glu Asp Trp Glu Glu Ala Ser Glu Trp Thr 
50 55 . 60 

Ser Trp Phe Asn Val Asp His Pro Gly Gly Asp Gly Asp Phe Glu 
65 70 .75 

Ser Leu Ala Ala lie Arg Phe Tyr Tyr Gly Pro Ala Arg Val Cys 
80 85 . 90 

Pro Arg Pro Leu Ala Leu Glu Ala. Arg Thr Thr Asp Trp Ala Leu 
95 100 * 105 

Pro Ser Ala Val Gly Glu Arg Val His Leu Asn. Pro Thr Arg Gly 
110 115 120 

Phe Trp Cys Leu Asn Arg Glu Gin Pro Arg Gly Arg Arg Cys Ser 
.125 130 135 

Asn Tyr His Val .Arg Phe Arg Cys Pro Leu Glu Ala Ser Trp Gly 
140 145 150 

Ala Trp Gly Pro Trp Gly Pro Cys Ser Gly Ser Cys Gly Pro Gly 
155 160 165 

Arg Arg Leu Arg Arg Arg His Cys Pro Ser Pro Ala Gly Asp Ala 
170 .175 180 

Cys Pro Gly Arg Pro Leu Glu Ala Gin Lys Cys Val Arg Pro Arg 
185 190 195 

Cys Pro Gly Cys Ser Leu Asp Thr Cys. Glu Cys Pro Asp His lie 
200 205 210 

Leu Leu Gly Ser Val Val Thr Pro Ser Gly Gin Pro Leu Leu Gly 
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215 










220 








225 


Ala Arg 


Val 


Ser 


Leu 


Arg 


Asp 


Gin 


Pro 


Gly 


Thr 


Val 


Ala 


Thr Ser 






230 










235 








240 


Asp Ala 


His 


Gly 


Thr 
245 


Phe 


Arg 


Val 


Pro 


Gly 
250 


Val 


Cys 


Ala 


Asp Ser 
255 


. Arg Ala 


Asn 


He 


Arg 


Ala 


Gin 


Met 


Asp 


Gly 


Phe 


Ser 


Ala 


Gly Glu 






260 










265 








270 


Ala Gin 


Ala 


Gin 


Ala 
275 


Asn 


Gly 


Ser 


He 


Ser 
280 


Val 


Val 


Thr 


He He 
285 


Leu Asp 


Lys 


Leu 


Glu 
290 


Lys 


Pro 


Tyr 


Leu 


Val 
295 


Lys 


His. 


Pro 


Glu Ser 
300 


Arg Val 


Arg 


Glu 


Ala 
305 


Gly 


Gin 


Asn 


Val 


Thr 
310 


Phe 


Cys 


Cys 


Lys Ala 

• • 315 


Ser Gly 


Thr 


Pro 


Met 
320 


Pro 


Lys 


Lys 


Tyr 


Ser 

325 


Trp 


Phe 


His 


Asn Gly 
330 


Thr Leu 


Leu 


Asp 


Arg 
335 


Arg 


Ala 


His 


Gly 


• Tyr 
340 


Gly 


Ala 


His 


Leu Glu 
345 


Leu Arg 


Gly 


Leu 


Arg 
350 


Pro 


Asp 


Gin 


Ala 


Gly 
355 


He 


Tyr 


His 


Cys Lys 
, 360 


Ala Trp 


Asn 


Glu 


Ala 
365 


Gly 


Ala 


Val 


Arg 


Ser 
370 


Gly 


Thr 


Ala 


Arg Leu 

375 


Thr Val 


Leu Ala Pro Gly Gin 


Pro 


Ala 


Cys 


Asp 


Pro 


Arg 


Pro Arg 








380 










385 








390 


Glu Tyr 


Leu 


He 


Lys 


Leu 


Pro 


Glu 


Asp Cys 


Gly 


Gin 


Pro 


Gly Ser 






395 










400 








405 


Gly. Pro 


Ala 


Tyr Leu Asp Val 


Gly Leu Cys 


Pro 


Asp 


Thr 


Arg Cys 








410 










415 








420 


.. Pro Ser Leu Ala Gly 


Ser 


Ser 


Pro Arg 


Cys 


Gly 


Asp 


Ala 


Ser Ser 








425 










430 








435 


Arg Cys Cys Ser Val Arg Arg 


Leu 


Glu 


Ar-g 


Arg 


Glu 


He 


His Cys 








440 










445 








450 


Pro Gly Tyr Val Leu 


Pro 


Val 


Lys Val Val 


Ala 


Glu 


Cys 


Gly Cys 








455 










460 








465 


Gin Lys Cys 


Leu 


Pro 


Pro Arg 


Gly Leu Val 


Arg 


Gly 


Arg 


Val Val 








470 










475 








460 


Ala Ala Asp Ser Gly Glu Pro 


Leu Arg 


Phe 


Ala 


Arg 


He 


Leu Leu 








485 










490 








495 


Gly Gin Glu 


Pro He Gly Phe 


Thr Ala Tyr 


Gin 


Gly 


Asp 


Phe Thr 








500 










505 








510 


lie Glu 


Val 


Pro 


Pro 


Ser 


Thr. 


Gin Arg Leu 


Val 


Val 


Thr 


Phe Val 








515 










520 








525 


Asp Pro Ser Gly Glu Phe Met 


Asp Ala Val 


Arg 


Val 


Leu 


Pro Phe 








530 










535 








540 


Asp Pro Arg Gly Ala Gly Val 


Tyr 


His 


Glu 


Val 


Lys 


Ala 


Met Arg 








545 










550 








555 


Lys Lys Ala Pro Val 


He 


Leu. 


His 


Thr 


Ser 


Gin 


Ser 


Asn 


Thr He 








560 










565 








570 


Pro Leu 


Gly 


Glu 


Leu 
575 


Glu 


Asp 


Glu 


Ala 


Pro 
580 


Leu 


Gly 


Glu 


Leu Val . 
585 


Leu .Pro Ser Gly Ala Phe Arg 


Arg Ala Asp 


Gly 


Lys 


Pro 


Tyr Ser 








590 










595 








600 


Gly Pro Val Glu Ala Arg Val 


Thr Phe Val 


Asp 


Pro 


Arg 


Asp Leu 








605 










610 








615 


Thr Ser 


Ala 


Ala 


Ser 


Ala 


Pro 


Ser Asp Leu 


Arg 


Phe 


Val 


Asp Ser 








620 










625 








630 


. Asp Gly Glu Leu Ala 


Pro 


Leu 


Arg Thr Tyr 


Gly 


Met 


Phe 


Ser Val 








635 










640 








645 


Asp Leu Arg Ala Pro Gly Ser 


Ala 


Glu 


Gin 


Leu 


Gin 


Val 


Gly Pro 








650 










655 








660 


Val Ala Val Arg Val Ala Ala 


Ser 


Gin 


He 


His 


Met 


Pro 


Gly His 








665 










670 








675 


Val Glu 


Ala 


Leu Lys 


Leu 


Trp 


Ser 


Leu 


Asn 


Pro 


Glu 


Thr 


Gly Leu 








680 










685. 








690 


Trp Glu Glu Glu Ser Gly Phe 


Arg Arg Glu 


Gly 


Ser 


Ser 


Gly Pro 
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, 695 700 705 

Arg Val Arg Arg Glu Glu Arg Val Phe Leu Val Gly Asn Val Glu 
710 715 " 720 

He Arg Glu Arg Arg Leu Phe Asn Leu Asp Val Pro Glu Arg Arq 
7 25 730 735 

Arg Cys Phe Val Lys Val Arg Ala Tyr Ala Asn Asp Lys Phe Thr 
740 745 750 

Pro Ser Glu Gin Val Glu Gly Val Val Val Thr Leu Val Asn Leu 
755 760 765 

Glu Pro Ala Pro Gly Phe Ser Ala Asn Pro Arg Ala Trp Gly Arg 
770 775 780 

Phe Asp Ser Ala Val Thr Gly Pro Asn Gly Ala Cys Leu Pro Ala 
785 790 795 

Phe Cys. Asp Ala Asp Arg Pro Asp Ala Tyr Thr Ala Leu Val Thr 
800 805 8io 

Ala Thr Leu Gly Gly Glu Glu Leu Glu Pro Ala Pro Ser Leu Pro 
815 820 825 

Arg Pro Leu Pro Ala Thr Val Gly Val Thr Gin Pro Tyr Leu Asp 
830 835 840 

Arg Leu Gly Tyr Arg Arg Thr Asp His Asp Asp Pro Ala Phe Lys 
845 850 855 

Arg Asn Gly Phe Ax y lie Asn 5eu Ala Lys Pro Arg Pro Gly Asp 
860 865 870 

Pro Ala Glu Ala Asn Gly Pro Val Tyr Pro Trp Arg Ser Leu Arg 
. 875 880 885 

Glu Cys Gin Gly Ala Pro Val .Thr Ala Ser His Phe Arg Phe Ala 
890 895 900 

Arg Val Glu Ala Asp Lys Tyr Glu Tyr Asn Val Val Pro Phe Arg 
905 910 915 

Glu Gly Thr Pro Ala Ser Trp Thr Gly Asp Leu Leu Ala Trp Trp 
920 925 930 

Pro Asn Pro Gin Glu Phe Arg Ala Cys Phe Leu Lys Val Lys He 
935 940 . 945 

Gin Gly Pro Gin Glu. Tyr Met Val Arg Ser His Asn Ala Gly Gly 
950 955 960 

Ser His Pro Arg Thr Arg Gly Gin Leu Tyr Gly Leu Arg Asp Ala 
.965 970 ~ 975 

Arg Ser Val Arg Asp Pro Glu Arg Pro Gly Thr Ser Ala Ala Cys 
980 985 990 

Val Glu Phe Lys Cys Ser Gly Met Leu Phe Asp Gin Arg Gin Val 
995 .1000 . 1005 

Asp Arg Thr Leu Val Thr He Met Pro Gin Gly Ser Cys Arg Arg 
1010 1015 " 1020 

Val Ala Val Asn Gly Leu Leu Arg Asp Tyr Leu Thr Arg His Pro 
1025 1030 . 1035 

Pro Pro Val Pro Ala Glu Asp Pro Ala Ala Phe Ser Met Leu Ala 
1040 1045 1050 

Pro Leu Asp Pro Leu Gly His Asn Tyr Gly Val Tyr Thr Val Thr 
1055 1060 . 1065 

Asp Gin Ser Pro Arg Leu Ala Lys Glu He Ala lie Gly Arg Cys 
1070 1075 - 1080 

Phe Asp Gly Ser Ser Asp Gly Phe Ser Arg Glu Met Lys Ala Asp 
1085 1090 • 1095 

Ala Gly Thr Ala Val Thr Phe Gin Cys Arg Glu Pro Pro- Ala Gly 
1100 • 1105 mo 

. Arg Pro Ser Leu Phe Gin Arg Leu Leu Glu Ser Pro Ala Thr Ala 
1115 1120 H25 

Leu Gly Asp He Arg Arg Glu Met Ser Glu Ala Ala Gin Ala Gin 
1130 1135 H40 

Ala Arg Ala Ser Gly Pro Leu Arg Thr Arg Arg Gly Arg Val Arg 
1145 1150 H55 

Gin 
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<210> 2 

<211> 4183 

<212> DNA 

<213> Homo sapiens 



unsure 
4180 

a or g or c' or t, unknown, or other 

<220>. - 
<223> 1388013 

<400> 2 

gcccgagcac gccgcggagc ccggacctcc ctcggacgct ctgccccggc catggcgtcg 60 
ctgctgccac tgctctgtct ctgtgtcgtc gctgcgcacc tggcgggggc ccgagacgcc 12 0' 
acccccaccg aggagccaat ggcgactgca ctgggcctgg aaaga,cggtc cgtgtacacc 180 
ggccagccct caccagccct ggaggactgg gaagaggcca gcgagtggac. gtcctggttc 240 
aacgtggacc accccggagg cgacggcgac ttcgagagcc tggctgccat ccgcttctac 300 
tacgggccag cgcgcgtgtg cccgcgaccg ctggcgctgg aggcgcgcac cacggactgg 36Q 
gccctgccgt ccgccgtcgg cgagcgcgtg cacttgaacc ccacgcgcgg cttctggtgc 420 
ctcaaccgcg agcaaccgcg tggccgccgc tgctccaact accacgtgcg cttccgctgc 480 
ccactagaag cctcgtgggg cgcgtggggc ccgtggggtc cctgctcggg gagctgtggg 54 0 
ccaggccgtc gcttgcgccg ccgccactgc ccaagccccg ctggggatgc gtgt.cccggg 600 
cgtcctctgg aggcgcagaa gtgcgtgcgg cctcggtgtc cagggtgcag ccttgacacc 660 
tgtgaatgcc cggaccacat cctcctgggc .tcggtggtca ccccatctgg gcaaccactg 720 
..ctaggagcca gggtctccct gcgagaccag cctggcactg tggccaccag cgatgctcac 780 
ggaaccttcc gggtgcctgg tgtctgtgct gacagccgcg ccaacatcag ggcccagatg' 840 
gatggcttct ctgcagggga ggcccaggcc caggccaacg gatccatctc tgtggtcacc 900 
atcatccttg ataagttgga gaagccgtac ctggtgaaac accctgagtc ccgagtgcga 960 
gaggctggcc agaatgtgac tttctgctgc aaagcctccg ggacccccat gcccaagaaa 1020 
tactcctggt tccacaatgg gaccctgctg gacaggcgag ctcatgggta cggggcccac 1080 
ctggagctsc ggggactgcg cccagaccag gctggcatct accactgcaa ggcatggaat 114Q 
gaggcgggtg ccgtgcgctc gggcactgcc cggctcactg tacttgcccc aggccagcca 1200 
gcctgcgacc cccggccccg agagtacctg atcaagctcc ctgaggactg tggtcagcca 1260 
ggtagtggcc ctgcctacct ggatgtgggc ctctgtcccg acacccgctg ccccagcctg 1320 
gcaggctcca gcccccgctg cggggacgcc agctcccgct gctgctctgt gcgccgtctg 1380 
gagagaaggg agattcactg ccctggctac gtcctcccag tgaaggtggt ggcagagtgt 1440 
ggctgccaga agtgtctgcc ccctcggggg ctggtccggg gccgtgttgt ggctgctgac 1500 
tccggggagc cgctacgctt cgccaggatt ctgctgggcc aggagcccat cggcttcacc 1560 
gcctaccagg gcgactttac cattgaggtg ccgccctcca cccagcggct ggtggtgact 1620 
tttgtggacc ccagcggtga gttcatggac gctgtccggg tcttgccttt tgatcctcga 1680 
ggtgccggcg tgtaccacga ggtcaaggcc atgcggaaga aagccccggt cattttacat 1740 
accagccaga gcaacacgat ccccctgggc gagctggaag atgaggcgcc cctgggcgag 1800 
ctggtcctgc cttctggcgc tttccgcaga gccgacggca aaccctactc ggggcctgtg 1860 
gaggcccggg tgacgttcgt ggacccccga gacctcacct cggcggcgtc tgcccccagt 1920 
gacctgcgct tcgtggacag cgacggcgag ctggctccac tgcgcaccta cggcatgttc 1980 
tccgtggacc tccgtgcgcc cggctccgcg gagcagctgc aggtggggcc ggtggccgtg 2040 
cgggtggccg ccagccagat ccacatgcca ggccacgtgg aggccctcaa gctgtggtisg 2100 
ctgaaccccg agaccggctt gtgggaggag gagagcggct tccggcgcga ggggtcctcg 2160 
ggcccccggg tgcgccggga ggagcgcgtc ttcctggtgg gcaacgtgga gatccgggag 2220 
cggcgcctgt tcaatctgga cgtgcctgag cgccgccgct gcttcgtgaa ggtgcgcgcc 2280 
tacgccaacg acaagttcac ccccagcgag caggtggagg gcgtggtggt cacgctggtc 2340 
aatctggagc ccgcccccgg cttctccgcc aacccccgtg cctggggccg ctttgacagc 2400 
gcggtcaccg gccccaatgg cgcctgcctc cccgccttct gcgacgccga caggccagac 2460 
gcctacaccg ccctggtcac cgccaccctg ggcggcgagg agctggagcc ggccccttcc 2520 
ttgccccgcc cactcccggc caccgtgggc gtcacccagc cctacctgga caggctgggg 2580 
taccgtcgga cggaccacga cgatcccgcc ttcaagcgta acggcttccg catcaacctc 2640 
gccaagccca ggccaggtga ccccgccgag gccaatgggc ctgtgtaccc gtggcgcagc 2700 
ctgcgggaat gccagggggc cccggtgact gccagccact tccgcttcgc cagggtggag 27 60 
gcggacaagt acgagtacaa cgtggtcccc ttccgagagg gcacacctgc ctcctggact 2820 
ggcgatctcc tggcctggtg gcccaacccg caggagttcc gggcctgctt cctcaaggtg 2880 
aagatccagg gtccccagga gtatatggtc cgctcccaca acgcaggggg cagccaccca 2940 
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cgcacccgcg gccagctcta cggacttc'gg gatgcccgga gtgtgcgaga ccccgag.cgt 3000 
ccgggcaect cggcagcctg cgtggagttc aagtgcagcg ggatgctgtt cgaccagcgg 3060 
caggtggaca ggacgctggt gaccattatg ccccagggca gctgccggcg cgtggccgtc 3120 
aacggactcc ttcgggatta cctgacccgg caccccccac cggtgcccgc ggaggaccca 3180 j 
gctgccttct ccatgctggc ccccctagac cctctgggcc acaactatgg cgtctacact 3240 
gtcactgacc agagcccacg cttggccaag gagatcgcca ttggccgctg ctttgatggt -3300 
tcctctgacg gyttctccag agagatgaag gctgatgccg gcacagccgt caccttccag 3360 
tgccgggagc caccggccgg acgacccagc ct'cttccaga ggctgctgga gtccccggcg 3420 
acagcacttg gtgacatccg cagggagatg agcgaggcgg cgcaggcaca ggcccgggcc 3480 . 
tcaggtcccc tccgcacccg ccggggtagg gtccggcagt gacctgggca ggggcctcgc 3540. 
tttcccacct ccctccagac tcctttgacc ccaggaagtt ttgcccctcc ttcttctcca 3600 
gacagccccc tccccaggtg tctgggtccc ctttcccgcc cctttccaga actcagagtc 3660 
agacaagaac ccagagcatc cgatggtaga aacaccagga agacaattgt tgctgtgtgg 3720 
tatggaatgg agtttgcggt gactctgggg ccagcaccca ggggacgacg . ttcaacccta 3780 
gcctgaaggg acccgctccc agctcagaag ccgtctctga cttctcgtgc gtattttgac 3840 
cctgatttca atcttctacc cttgggagtt ctggcgtttg gcacaaagtc ccctctgcct ' 3900 
gtttggagct cagtgctaga ccaggtccqc tgccccgagc tttgtttttg gggttattta 3960 
ttgaaacaaa gtgtggggag ctggttgtgg gtgtgagtgg gggtgtgggg tccaggctgg 4020 
gcccagtgaa aaggaggaag gggttcccat gcgggggagg ctctggggct gaggggaaca 4080 
attctcacgt gtttggtgct tagagacctg cccggggcgt tgggcaggcc ctccgggggc 4140 
tgaattaaaa atgctttatt tccaaaaaaa aaaaaaaaan aaa "41 83 



<210> 3 

<211> 1156 

<212> PRT 

<213> Homo sapiens 

<220> - 
<223> 422069 



Cys Leu Cys Val Val Ala Ala 

10 15 

Thr Pro Thr Glu Glu Pro Met 

25 30 

Arg Ser Val Tyr Thr Gly Gin 

40. 45 

Glu Glu Ala Ser Glu Trp Thr 

55 60 

Gly Gly Asp Gly Asp Phe Glu 

70 . 75 

Tyr Gly Pro Ala Arg Val Cys 

85 90 

Arg Thr Thr Asp Trp Ala L6u 

100 105 
His Leu Asn Pro Thr Arg Gly 

11S 120 

Pro Arg Gly Arg Arg Cys Ser 

130 135 

Pro Leu Glu Ala Ser Trp Gly 

145 150 

Ser Gly Ser Cys Gly Pro Gly 

160 165 • 

Pro Ser Pro Ala Gly Asp Ala 

175 . 180 

Gin Lys Cys Val Arg Pro Arg 

.190 " 195. 

Cys Glu Cys Pro Asp His lie 

'205 210 
Ser Gly Gin Pro Leu Leu Gly * 

220 225 

Pro Gly Thr Val Ala Thr Ser. 

235 240 



<400> 3 


Met 


Ala 


1 




His 


Leu 


Ala 


Thr 


Pro 


Ser 


Ser 


Trp 


Ser 


Leu 


Pro 


Arg 


Pro 


Ser 


Phe 


Trp 


Asn 


Tyr 


Ala 


Trp 


Arg 


Arg 


Cys 


Pro 


Cys 


Pro 


Leu 


Leu 


Ala 


Arg 



Ser Leu 
Ala Gly 
Ala Leu 
Pro Ala 
Phe Asn 
Ala Ala 
Pro Leu 
Ala Val 
Cys Leu 
His Val 
Gly Pro 
Leu Arg 
Gly Arg 
Gly Cys 
Gly Ser 
Val Ser 



Leu 


Pro 


5 




Ala Arg 


20 




Gly 


Leu 


35 




Leu 


Glu 


50 




Val 


Asp. 


65 




He 


Arg 


80 




Ala 


Leu 


95 




Gly 


Glu 


110 




Asn 


Arg 


125 




Arg 


Phe 


140 




Trp Gly 


155 




Arg Arg 


170 




Pro 


Leu 


185 




Ser 


Leu 


200 




Val 


Val 


215 




Leu 


Arg 


230 





Leu Leu 
Asp Ala 
Glu Arg 
Asp Trp 
His Pro 
Phe Tyr 
Glu Ala 
Arg Val 
Glu Gin 
Arg Cys 
Pro Cys 
His Cys 
Glu Ala 
Asp Thr 
Thr Pro 
Asp Gin 
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. Asp 


Ala 


His 


Gly 


Thr 


Phe 


Arg 


Val 


Pro 


Gly 


Val 


Cys 


Ala 


Asp 


Ser 








245 










250 










255 


Arg 


Ala 


Asn 


He 


Arg. 


Ala 


Gin 


Met 


Asp 


Gly 


Phe 


'Ser 


Ala 


Gly 


Glu 








.260 










265 










270 


. Ala 


Gin 


Ala 


Gin 


Ala 

275 


Asn 


Gly 


Ser 


He 


Ser 
280 


Val 


Val 


Thr 


He 


He 
285 


Leu 


Asp 


Lys 


Leu 


Glu 


Lys 


Pro 


Tyr 


Leu 


Val 


Lys 


His 


Pro 


Glu 


Ser 






.290 










295 










300 


Arg 


Val 


Arg 


Glu 


Ala 


Gly 


Gin 


Asn 


Val 


Thr 


Phe 


Cys 


Cys 


Lys 


Ala 








305 








310 










315 


Ser 


Gly 


Thr 


Pro 


Met 


Pro 


Lys 


Lys 


Tyr 


Ser 


Trp 


Phe 


His 


Asn 


Gly 








320 










325 










330 


Thr 


Leu 


Leu 


Asp 


Arg 
335 


Arg 


Ala 


His 


Gly 


Tyr 
340 


Gly 


Ala 


His 


Leu 


Glu 
345 


Leu 


Arg 


Gly 


Leu 


Arg 
350 


Pro 


Asp 


Gin 


Ala 


Gly 

•355 


He 


Tyr 


His 


Cys 


Lys 
360 


Ala 


Trp 


Asn 


Glu 


Ala 


Gly 


Ala 


Val 


Arg 


Ser 


Gly 


Thr 


Ala 


Arg 


Leu 








365 










370 










375 


Thr 


Val 


Leu 


Ala 


Pro 
380 


Gly 


Gin 


Pro 


Ala 


Cys 
385 


Asp 


Pro 


Arg 


Pro 


Arg 
390 


Glu 


Tyr 


Leu 


He 


Lys 


Leu 


Pro 


Glu 


Asp 


Cys 


Gly 


Gin 


Pro 


Gly 


Ser 



395 ----- - 40Q 4Q5 

Gly Pro Ala Tyr Leu Asp Val Gly Leu Cvs Pro Asp Thr Arg Cys 

410 ' 43.5 420 

Pro Ser Leu Ala Gly Ser Ser Pro Arg Cys Gly Asp Ala Ser Ser 

425 430 435 

Arg Cys Cys Ser Val Arg Arg Leu Glu Arg Arg Glu He His . Cys 

440 445 450 

Pro Gly Tyr Val Leu. Pro Val Lys Val Val Ala Glu Cys. Gly Cys 

455 460 465. 

Gin Lys Cys Leu Pro Pro Arg Gly Leu Val Arg Gly Arg Val Val 

470 475 .480 

Ala Ala Asp Ser Gly Glu Pro Leu Arg Phe Ala Arg He Leu . Leu 
., 485 490 495 

Giy Gin Glu Pro He Gly Phe Thr Ala Tyr Gin Gly Asp Phe Thr 

500 505 510 

He Glu Val Pro Pro Ser Thr Gin Arg' Leu Val Val Thr Phe Val 

515 520 525 

Asp Pro Ser Giy Glu Phe Met Asp Ala Val Arg Val Leu Pro Phe 

530 53.5 540 

Asp Pro Arg Gly Ala Gly Val Tyr His Glu Val Lys Ala Met Arg 

545 550 555 

Lys Lys Ala Pro Val He Leu His Thr Ser Gin Ser Asn Thr He 

560 565 570 

Pro Leu Gly Glu Leu Glu Asp Glu Ala Pro Leu Gly Glu Leu Val 

575 580 585 

* Leu Pro Ser Gly Ala Phe Arg Arg Ala Asp Gly Lys Pro Tyr Ser. 

590 595 600 

Gly Pro Val Glu Ala Arg Val. Thr Phe Val Asp. Pro Arg Asp Leu 

605 • 610 615 

Thr Ser Ala Ala Ser Ala Pro Ser Asp Leu Arg Phe Val Asp Ser 

620 625 630 

Asp Gly Glu Leu Ala Pro Leu Arg Thr Tyr Gly Met Phe Ser Val 

635 640 645 

Asp Leu Arg Ala Pro Gly Ser Ala Glu Gin Leu Gin Val Gly Pro 

650 655 660 

Val Ala Val Arg Val Ala Ala Ser Gin He His Met Pro Gly His 

665 670 675 

Val Glu Ala Leu Lys Leu Trp Ser Leu Asn Pro Glu Thr Gly Leu 

680 685 690 

Trp Glu Glu Glu Ser Gly Phe Arg Arg Glu Gly Ser Ser Gly Pro 

695 7Q0 . 705 

Arg Val Arg Arg Glu Glu. Arg Val Phe Leu. Val Gly Asn Val Glu 

710 715 720 

6/7 



CIO <WO 9932610A1J_> 



WO 99/32610 PCT/US98/25558 



\ lie 
i 


Arg Glu 


Arg Arg 


Leu 


Phe 


Asn Leu Asp 


Val 


Pro 


Glu 


Arg Arg 






725 








730 








735 


Arg 


Cys 


Phe 


Val Lys 


Val 


Arg 


Ala Tyr. Ala 


Asn 


Asp 


Lys 


Phe .Thr 








740 








745 




750 


Pro 


Ser 


Glu 


Gin Val 


Glu 


Gly 


Val Val 


Val 


Thr 


Leu 


Val 


Asn Leu 


1 






755 








760 








765 


Glu 


Pro 


Ala 


Pro Gly 


Phe 


Ser 


Ala Asn. Pro 


Arg 


Ala 


Trp 


Gly Arg 








770 








775 








780 


Phe 


Asp 


Ser 


Ala Val 


Thr 


Gly 


Pro Asn 


Gly 


Ala 


Cys 


Leu 


Pro Ala 








785 








790 






795 . 


Phe 


Cys Asp 


Ala Asp 


Arg 


Pro 


Asp Ala 


Tyr 


Thr 


Ala 


Leu 


Val Thr ' 








' 800 






805 








810 


Ala 


Thr Leu 


Gly Gly 


Glu 


Glu 


Leu Glu 


Pro 


Ala 


Pro 


Ser 


Leu ' Pro 








815 








820 








; 825 


Arg 


Pro 


Leu 


Pro Ala 


Thr 


Val 


Gly Val 


Thr 


Gin 


Pro 


Tyr 


Leu Asp ■ 








830 








835 






840 


Arg 


Leu Gly 


Tyr Arg 


Arg 


Thr 


Asp His 


Asp 


Asp 


Pro' 


Ala 


Phe Lys 








845 








850 








855 


Arg 


Asn Gly 


Phe Arg 


He 


Asn 


Leu Ala 


Lys 


Pro 


Arg 


Pro 


Glv Aso 


• 






860 




* 




865 








870 


Pro 






Ala Asn 


Gly 


Pro 


Val Tyr 


Pro 


Trp 


Arg 


Ser 


Leu Arg 








8^5- 


















Glu 




m n 


Gly Ala 


Pro 


Val 


Thr Ala 


Ser 


His. 


Phe Arg 


Phe Ala 








' 890 








895 








900 


. Arg 


VoX 


ul U 


Ala Asp 


Lys 


Tyr 


Glu Tyr 


Asn 


Val. .Val 


Pro 


Phe Arg' 








905 








910 








915 


Glu 


ft! V 




Pro Ala 


Ser 


Trp 


Thr Gly 


Asp 


Leu 


Leu Ala 


Trp Trp 








920 








925 








930 


Pro 


&en 


Pro 


Gin Glu 


Phe Arg 


Ala Cys 


Phe 


Leu Lys Val 


Lys He 








935 








940 








945 


Gin 


Glv 


Prn 


Gin Glu 


Tyr Met 


Val Arg 


Ser 


His 


Asn 


Ala 


Gly Gly 








.950 








955 








960 


Ser 


His 


Pro 


Arg Thr 


Arg Gly 


Gin Leu 


Tyr 


Gly Leu Arg 


Asp Ala 








965 








970 








975 


Arg 


Ser 


Val 


Arg Asp 


Pro 


Glu 


Arg Pro 


Gly 


Thr 


Ser Ala 


Ala Cys 








980 








985 








990 


Val 


Glu 


Phe 


Lys Cys 


Ser Gly 


Met Leu 


Phe 


Asp Gin Arg 


Gin Val 








995 






1000- 








1005 


Asp 


Arg 


Thr 


Leu Val 


Thr lie 


Met Pro 


Gin 


Gly Ser Cys 


Arg Arg 








1010. 






1015 








1020 


Val 


Ala 


Val 


Asn Gly 


Leu 


Leu 


Arg Asp Tyr 


Leu Thr Arg 


His Pro 








1025 






1030 








1035 


Pro 


Pro 


Val 


Pro Ala 


Glu Asp 


Pro Ala 


Ala 


Phe 


Ser 


Met 


Leu. Ala 








1040 






. 1045 








1050 


Pro 


Leu 


Asp 


Pro Leu 


Gly His 


Asn Tyr 


Gly 


Val 


Tyr Thr 


Val Thr 








1055 






1060 








1065 


Asp 


Gin 


Ser 


Pro Arg 


Leu 


Ala 


Lys Glu 


He 


Ala He Gly 


Ara Cvs 








1070 






1075 








1080 


Phe 


Asp 


Gly 


Ser Ser 


Asp Gly 


Phe Ser Arg 


Glu .Met Lys 


Ala Asp 








1085 






1090 








1095 


' Ala 


Gly 


Thr 


Ala Val 


Thr 


Phe 


Gin Cys Arg 


Glu 


Pro 


Pro 


Ala. Gly 








1100 






1105 








1110 


Arg 


Pro 


Ser 


Leu Phe 


Gin Arg 


Leu Leu 


Glu 


Ser 


Pro Ala 


Thr Ala 








1115 






1120 








1125 


Leu 


Gly 


Asp 


lie Arg 


Arg Glu 


Met Ser 


Glu 


Ala Ala Gin Ala Gin 






1130 






1135 








1140 


•Ala 


Arg 


Ala 


Ser Gly 


Pro 


Leu 


Arg Thr Arg 


Arg Gly Arg Val Arg 






1145 








1150 








' 1155 



Gin 
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